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Preface 


Riemannian geometry is characterized, and research is oriented towards and 
shaped by concepts (geodesics, connections, curvature, ...) and objectives, in 
particular to understand certain classes of (compact) Riemannian manifolds 
defined by curvature conditions (constant or positive or negative curvature, 
...). By way of contrast, geometric analysis is a perhaps somewhat less system- 
atic collection of techniques, for solving extremal problems naturally arising 
in geometry and for investigating and characterizing their solutions. It turns 
out that the two fields complement each other very well; geometric analysis 
offers tools for solving difficult problems in geometry, and Riemannian geom- 
etry stimulates progress in geometric analysis by setting ambitious goals. 

It is the aim of this book to be a systematic and comprehensive intro- 
duction to Riemannian geometry and a representative introduction to the 
methods of geometric analysis. It attempts a synthesis of geometric and an- 
alytic methods in the study of Riemannian manifolds. 

The present work is the fourth edition of my textbook on Riemannian 
geometry and geometric analysis. It has developed on the basis of several 
graduate courses I taught at the Ruhr-University Bochum and the University 
of Leipzig. Besides several smaller additions, reorganizations, corrections (I 
am grateful to J. Weber and P.Hinow for useful comments), and a systematic 
bibliography, the main new features of the present edition are a systematic in- 
troduction to Kahler geometry and the presentation of additional techniques 
from geometric analysis. 

Let me now briefly describe the contents: 

In the first chapter, we introduce the basic geometric concepts, like dif- 
ferentiable manifolds, tangent spaces, vector bundles, vector fields and one- 
parameter groups of diffeomorphisms, Lie algebras and groups and in par- 
ticular Riemannian metrics. We also derive some elementary results about 
geodesics. 

The second chapter introduces de Rham cohomology groups and the es- 
sential tools from elliptic PDE for treating these groups. In later chapters, 
we shall encounter nonlinear versions of the methods presented here. 

The third chapter treats the general theory of connections and curvature. 

In the fourth chapter, we introduce Jacobi fields, prove the Rauch com- 
parison theorems for Jacobi fields and apply these results to geodesics. 
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These first four chapters treat the more elementary and basic aspects 
of the subject. Their results will be used in the remaining, more advanced 
chapters that are essentially independent of each other. 

The fifth chapter treats symmetric spaces as important examples of Rie- 
mannian manifolds in detail. 

The sixth chapter is devoted to Morse theory and Floer homology. 

The seventh chapter treats variational problems from quantum field the- 
ory, in particular the Ginzburg-Landau and Seiberg-Witten equations. The 
background material on spin geometry and Dirac operators is already devel- 
oped in earlier chapters. 

In the eighth chapter, we treat harmonic maps between Riemannian ma- 
nifolds. We prove several existence theorems and apply them to Riemannian 
geometry. The treatment uses an abstract approach based on convexity that 
should bring out the fundamental structures. We also display a representative 
sample of techniques from geometric analysis. 

A guiding principle for this textbook was that the material in the main 
body should be self contained. The essential exception is that we use material 
about Sobolev spaces and linear elliptic PDEs without giving proofs. This 
material is collected in Appendix A. Appendix B collects some elementary 
topological results about fundamental groups and covering spaces. 

Also, in certain places in Chapter 6, we do not present all technical details, 
but rather explain some points in a more informal manner, in order to keep 
the size of that chapter within reasonable limits and not to loose the patience 
of the readers. 

We employ both coordinate-free intrinsic notations and tensor notations 
depending on local coordinates. We usually develop a concept in both no- 
tations while we sometimes alternate in the proofs. Besides my not being a 
methodological purist, the reasons for often prefering the tensor calculus to 
the more elegant and concise intrinsic one are the following. For the analytic 
aspects, one often has to employ results about (elliptic) partial differential 
equations (PDEs), and in order to check that the relevant assumptions like 
ellipticity hold and in order to make contact with the notations usually em- 
ployed in PDE theory, one has to write down the differential equation in 
local coordinates. Also, recently, manifold and important connections have 
been established between theoretical physics and our subject. In the physical 
literature, tensor notation is usually employed, and therefore familiarity with 
that notation is necessary to explore those connections that have been found 
to be stimulating for the development of mathematics, or promise to be so 
in the future. 

As appendices to most of the paragraphs, we have written sections with 
the title “Perspectives” . The aim of those sections is to place the material in 
a broader context and explain further results and directions without detailed 
proofs. The material of these Perspectives will not be used in the main body 
of the text. At the end of each chapter, some exercises for the reader are given. 
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IX 


We assume of the reader sufficient perspicacity to understand our system of 
numbering and cross-references without further explanation. 

The development of the mathematical subject of Geometric Analysis, 
namely the investigation of analytical questions arising from a geometric 
context and in turn the application of analytical techniques to geometric 
problems, is to a large extent due to the work and the influence of Shing- 
Tung Yau. This book, like its previous editions, is dedicated to him. 

Jurgen Jost 
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1. Foundational Material 


1.1 Manifolds and Differentiable Manifolds 

A topological space is a set M together with a family O of subsets of M 
satisfying the following properties: 

(i) f?i, 17 2 G O => Q 1 l~l 17 2 G O 

(ii) For any index set A : 

(n a ) aeA cO=> U f 2 a eo 

a&A 

(iii) 0,MeO 

The sets from O are called open. A topological space is called Hausdorff if 

for any two distinct points pi , p 2 £ M there exists open sets 0 \ , 17 2 £ O with 
Pi £ fti,P 2 G 1? 2 jt?i n f? 2 = 0- A covering (f2 a ) a&A (A an arbitrary index 
set) is called locally finite if each p £ M has a neighborhood that intersects 
only finitely many l? a . M is called paracompact if any open covering possesses 
a locally finite refinement. This means that for any open covering (fi a ) ae A 
there exists a locally finite open covering b with 

V /3 £ B 3 a £ A : C fl a . 

A map between topological spaces is called continuous if the preimage of any 
open set is again open. A bijective map which is continuous in both directions 
is called a homeomorphism. 

Definition 1.1.1 A manifold M of dimension d is a connected paracom- 
pact Hausdorff space for which every point has a neighborhood U that is 
homeomorphic to an open subset 17 of R d . Such a homeomorphism 

x : U — > 17 


is called a (coordinate) chart. 

An atlas is a family {U a ,x a } of charts for which the U a constitute an 
open covering of M. 


Remarks. 



2 


1. Foundational Material 


1) A point p £ U a is determined by x a {p)\ hence it is often identified 
with x a (p). Often, also the index a is omitted, and the components 
of x(p) £ are called local coordinates of p. 

2) Any atlas is contained in a maximal one, namely the one consisting 
of all charts compatible with the original one. 

Definition 1.1.2 An atlas {U a ,x Q } on a manifold is called differentiable if 
all chart transitions 

xp o x a . XoXfJot n up) * xp(u a n up) 

are differentiable of class C°° (in case U a fl Up 0) . A maximal differentiable 
atlas is called a differentiable structure, and a differentiable manifold of di- 
mension d is a manifold of dimension d with a differentiable structure. From 
now on, all atlases are supposed to be differentiable. Two atlases are called 
compatible if their union is again an atlas. In general, a chart is called com- 
patible with an atlas if adding the chart to the atlas yields again an atlas. An 
atlas is called maximal if any chart compatible with it is already contained 
in it. 

Remarks. 

1) Since the inverse of xp o xff 1 is x a o xff 1 , chart transitions are differ- 
entiable in both directions, i.e. diffeomorphisms. 

2) One could also require a weaker differentiability property than C°° . 

3) It is easy to show that the dimension of a differentiable manifold is 
uniquely determined. For a general, not differentiable manifold, this 
is much harder. 

4) Since any differentiable atlas is contained in a maximal differentiable 
one, it suffices to exhibit some differentiable atlas if one wants to 
construct a differentiable manifold. 

Definition 1.1.3 An atlas for a differentiable manifold is called oriented if 
all chart transitions have positive functional determinant. A differentiable 
manifold is called orientable if it possesses an oriented atlas. 

It is customary to write the Euclidean coordinates of ft C open, 
as 

x=(x 1 ,...,x d ), (1-1.1) 

and these then are considered as local coordinates on our manifold M when 
x : U — > ft is a chart. 


Example. 
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1) The sphere S n := {(cc\ . . . , x n+1 ) G R" +1 : S (a;*) 2 = 1} is a 

i = 1 

differentiable manifold of dimension n. Charts can be given as follows: 
On XJ\ := 5 n \{(0, . . . , 0, 1)} we put 

hix 1 , ■■ • • , x n+1 ) := (. fl(x \ . . . , x n+1 ), . . . , fiix 1 ,. . . , x n+1 )) 
f x 1 x™ \ 

•“ y 1 — x n+1 ’ ‘ ‘ ’ 1 — x n+1 ) 

and on U 2 := S' n \{(0, . . . , 0, —1)} 

/ 2 (:r\ . . . , a: n+1 ) := (/a (a: 1 , . . . , x n+1 ), f?(x \. . . , x n+1 )) 

/ x 1 x n \ 

\l + a: ri + 1 ’‘'‘’ 1 + x n+1 ) ' 

2) Let wi,w 2 , ■ ■ ■ ,w n G R" be linearly independent. We consider 
zi,z 2 G R" as equivalent if there are mi , m 2 , • • • , rri n G Z with 

n 

Zl - z 2 = ^2 m i W i 
2=1 

Let 7 r be the projection mapping z G R n to its equivalence class. The 
torus T n := 7r(R") can then be made a differentiable manifold (of 
dimension n) as follows: Suppose A a is open and does not contain 
any pair of equivalent points. We put 

U a := n (A*) 

3) The preceding examples are compact. Of course, there exist also non- 
compact manifolds. The simplest example is R d . In general, any open 
subset of a (differentiable) manifold is again a (differentiable) mani- 
fold. 

4) If M and N are differentiable manifolds, the Cartesian product M x 
N also naturally carries the structure of a differentiable manifold. 
Namely, if {U a ,x a } aG A and { Vjj, ypjpzB are atlases for M and N, 
resp., then {U a x Vp, (x a , yp)}( a ,p)eAxB is an atlas for M x N with 
differentiable chart transitions. 

Definition 1.1.4 A map h : M M' between differentiable manifolds M 
and M' with charts {U a ,x a } and {U' al x' a } is called differentiable if all maps 
x'poho x~ x are differentiable (of class C °° , as always) where defined. 

Such a map is called a diffeomorphism if bijective and differentiable in both 
directions. 
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For purposes of differentiation, a differentiable manifold locally has the 
structure of Euclidean space. Thus, the differentiability of a map can be 
tested in local coordinates. The diffeomorphism requirement for the chart 
transitions then guarantees that differentiability defined in this manner is a 
consistent notion, i.e. independent of the choice of a chart. 

Remark. We want to point out that in the context of the preceding def- 
initions, one cannot distinguish between two homeomorphic manifolds nor 
between two diffeomorphic differentiable manifolds. 

When looking at Definitions 1.1.2, 1.1.3, one may see a general pattern 
emerging. Namely, one can put any type of restriction on the chart transitions, 
for example, require them to be affine, algebraic, real analytic, conformal, 
Euclidean volume preserving,..., and thereby define a class of manifolds with 
that particular structure. Perhaps the most important example is the notion 
of a complex manifold. We shall need this, however, only at certain places in 
this book, namely in 5.1, 5.2. 

Definition 1.1.5 A complex manifold of complex dimension d (dimc-M = d) 
is a differentiable manifold of (real) dimension 2d (diniRM = 2d) whose charts 
take values in open subsets of C d with holomorphic chart transitions. 

In the case of a complex manifold, it is customary to write the coordinates 
of C d as 

z = (z 1 , . . . , z d ), with z 3 = x 3 + iy 3 , (1.1.2) 

with i ■= %/— T> that is, use (x 1 ,?/ 1 ,. . . ,x d ,y d ) as Euclidean coordinates on 
R 2d . We then also put 

z 3 := x 3 — iy 3 . 

The requirement that the chart transitions zpoz~ l : z a (U a nUp) — > zp(U a C 1 
Up) be holomorphic then is expressed as 

dzl 
dz« 

for all j, k where 

d _ 1 / d . d 
dz ^ 2 \dx^ dyk 

We also observe that a complex manifold is always orientable because 
holomorphic maps always have a positive functional determinant. 

We conclude this section with a useful technical result. 

Lemma 1.1.1 Let M be a differentiable manifold, (U a )\aeA cm open cover- 
ing. Then there exists a partition of unity, subordinate to ( U a ). This means 


(1.1.4) 
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that there exists a locally finite refinement (Vp)p e B of (U a ) and C£° (i.e. C°° 
functions ipp with {x £ M : <pp(x) 0} having compact closure) functions 
ipp : M — > R. with 

(i) suppipp C Vp for all (3 £ B. 

(ii) 0 < y>p(x) < 1 for all x £ M,f3 £ B. 

(Hi) E fp{x) = 1 for all x £ M. 

(3eB 

Note that in (Hi), there are only finitely many nonvanishing summands at 
each point since only finitely many (pp are nonzero at any given point because 
the covering (Vp) is locally finite. 

Proof. See any advanced textbook on Analysis, e.g. J.Jost, Postmodern Anal- 
ysis, 3rd ed., Springer, 2005. □ 


Perspectives. Like so many things in Riemannian geometry, the concept of a 
differentiable manifold was in some vague manner implicitly contained in Bern- 
hard Riemann’s habilitation address “Uber die Hypothesen, welche der Geometrie 
zugrunde liegen”, reprinted in [254]. The first clear formulation of that concept, 
however, was given by H. Weyl[252j. 

The only one dimensional manifolds are the real line and the unit circle S 1 , 
the latter being the only compact one. Two dimensional compact manifolds are 
classified by their genus and orientability character. In three dimensions, there exists 
a program by Thurston[243, 244] about the possible classification of compact three- 
dimensional manifolds. References for the geometric approach to this classification 
will be given in the Survey on Curvature and Topology after Chapter 4 below. - In 
higher dimensions, the plethora of compact manifolds makes a classification useless 
and impossible. 

In dimension at most three, each manifold carries a unique differentiable struc- 
ture, and so here the classifications of manifolds and differentiable manifolds coin- 
cide. This is not so anymore in higher dimensions. Milnor[180, 181] discovered exotic 
7-spheres, i.e. differentiable structures on the manifold S' that are not diffeomorphic 
to the standard differentiable structure exhibited in our example. Exotic spheres 
likewise exist in higher dimensions. Kervaire[156] found an example of a manifold 
carrying no differentiable structure at all. 

In dimension 4, the understanding of differentiable structures owes important 
progress to the work of Donaldson. He defined invariants of a differentiable 4- 
manifold M from the space of selfdual connections on principal bundles over it. 
These concepts will be discussed in more detail in §3.2. 

In particular, there exist exotic structures on ]R 4 . A description can e.g. be 
found in [79]. 
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1. Foundational Material 


1.2 Tangent Spaces 


Let x = (a: 1 ,.. . . , x d ) be Euclidean coordinates of R d , 17 C R d open, xo £ 17. 
The tangent space of 17 at the point Xq, 


T Xq Q 


is the space {xo} x E, where E is the d-dimensional vector space spanned by 
the basis g|r , . . . , . Here, g|r , . . . , g §3 are the partial derivatives at the 

point xo- If 17 C R d , 17' C R c are open, and / : 17 — > 17' is differentiable, we 
define the derivative df{x o) for xo £ 17 as the induced linear map between 
the tangent spaces 


df{x 0 ) 


'■T Xo ^ Tf( x 0 )l7 


d dfj 


d 

w 


Here and in the sequel, we use the Einstein summation convention: An index 
occuring twice in a product is to be summed from 1 up to the space dimension. 
Thus, v l is an abbreviation for 


E 


v 


i_d_ 
dx' 1 ’ 


idfl 

dx % 


d 

dp 


stands for 


d c 


EE 


dp d 
dx i dft 


In the previous notations, we put 


T17 := 17 x E ^ 17 x R d . 


Thus, T17 is an open subset of R d x R d , hence in particular a differentiable 
manifold. 

7r : T17 — > 17 (projection onto the first factor) 

(x, v) X 

is called a tangent bundle of 17. T17 is called the total space of the tangent 
bundle. 
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Likewise, we define 


df :Tfl -► Tfl' 

Instead of 

df(x,v) 

we write 

df{x)(v). 

If in particular, / : ft — > M is a differentiable function, we have for v = v' ,jd 
df(x)(v) = v i ^- i (x)eT f{x) R^R. 


In this case, we often write v(f)(x) in place of df(x)(v) when we want to 
express that the tangent vector v operates by differentiation on the function 
/• 

Let now M be a differentiable manifold of dimension d, and p £ M. We 
want to define the tangent space of M at the point p. Let x : U —> R d be 
a chart with p £ U, U open in M. We say that the tangent space T p M is 
represented in the chart x by T x ( p yx(U). Let x' : U' —> R d be another chart 
with p £ U' , U' open in M. fl := x(U), ft' := x'{U'). The transition map 


x' ox 1 : x(U D U') — > x'(U fl U') 


induces a vector space isomorphism 

L := d(x' o x _1 )(a:(p)) : T x(p) i7 -> T x , {p) f2' . 

We say that v £ T x uyQ and L[v) £ T x y p )ff represent the same tangent 
vector in T p M. Thus, a tangent vector in T p M is given by the family of its 
coordinate representations. This is motivated as follows: Let / : M — ■> R. 
be a differentiable function. Assume that the tangent vector w £ T p M is 
represented by v € T x ^x(U). We then want to define df(p) as a linear map 
from T p M to R. In the chart x, let w £ T p M be represented by v = £ 

T x ( p )x(U). We then say that 

df(jp)(w) 

in this chart is represented by 

Now 

d(f o a: -1 )(;r(p))(v) = d(f o x '~ l oi'o x _1 )(a;(p))(u) 

= d(f o x'- 1 )^' (p))(L(y)) 
by the chain rule 

= d(f o a; ,_1 )(a; , (p)) o d( x' o a: _1 )(a;(p))(t)) 
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Thus, in the chart x ' , w is represented by L(v). Here, a fundamental idea 
emerges that will be essential for the understanding of the sequel. T p M is 
a vector space of dimension d, hence isomorphic to This isomorphism, 
however, is not canonical, but depends on the choice of a chart. A change of 
charts changes the isomorphism, namely at the point p by the linear trans- 
formation L = d(x' ox~ 1 )(x(p)). Under a change of charts, also other objects 
then are correspondingly transformed, for example derivatives of functions, 
or more generally of maps. In other words, a chart yields local representations 
for tangent vectors, derivatives, etc., and under a change of charts, these lo- 
cal representations need to be correctly transformed. Or in still other words: 
We know how to differentiate (differentiable) functions that are defined on 
open subsets of R d . If now a function is given on a manifold, we pull it back 
by a chart, to an open subset of and then differentiate the pulled back 
function. In order to obtain an object that does not depend on the choice of 
chart, we have to know in addition the transformation behavior under chart 
changes. A tangent vector thus is determined by how it operates on functions 
by differentiation. 

Likewise, for a differentiable map F : M — > N between differentiable 
manifolds, dF is represented in local charts x : U C M — > R d , y : V C N — > 
R c by 

d(y o F o aA 1 ). 

In the sequel, in our notation, we shall frequently drop reference to the charts 
and write instead of d(y of oi _1 ) simply dF , provided the choice of charts 
or at least the fact that charts have been chosen is obvious from the context. 
We can achieve this most simply as follows: 

Let the local coordinates on U be 

(a: 1 , . . . , x d ), 

and those on V be (F 1 , . . . , F c ). We then consider F(x) as abbreviation for 
(F\x\...,x d ),...,F c (x\...,x d )). 
dF now induces a linear map 

dF : T X M — > T F ( X )N, 

which in our coordinates is represented by the matrix 



A change of charts leads to a base change of the tangent spaces, and the 
transformation behavior is determined by the chain rule. If 

(a; 1 , . . . , x d ) i ^ (£\...,£ d ) 
and (F 1 , . , F c ) (& 1 , . . . , <P C ) 
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are coordinate changes, then dF is represented in the new coordinates by 

(d$P\_ dF a OF \ 

\ d£ 'i ) \ dF a dx i ) 

Note that the functional matrix of the coordinate change of the image N, but 
the inverse of the functional matrix of the coordinate change of the domain M 
appears here. We also remark that for a function ip : N — » K and ate T X M 

{dF(v){' P )){F{x)) : = dip(dF(v))(F(x)) 

by definition of the application of 
dF(v ) £ Tp( x )N to Lp : N — > R. 

= d(<p o F)(v)(x) by the chain rule 
= v(<p o F)(x) by definition of the application of 
v £ T X M to ip o F : M — >■ R 

Instead of applying the tangent vector dF{v ) to the function, one may also 
apply the tangent vector v to the “pulled back” function ip o F. 

We want to collect the previous considerations in a formal definition: 

Definition 1.2.1 Let p £ M. On {(x,v) : x : U — > Q chart with p £ U,v £ 
T x (p )f?} (x,v) ~ ( y,w ) : w = d{y o x~ l )v. The space of equivalence 

classes is called the tangent space to M at the point p, and it is denoted by 
T p M. 

T p M naturally carries the structure of a vector space: 

The equivalence class of Ai(a;,ui) + \ 2 {x,V 2 ) (Ai,A 2 £ R) is the one 
of (x, AiUi + A 2 U 2 ). We now want to define the tangent bundle of a differ- 
entiable manifold of dimension d. TM is the disjoint union of the tangent 
spaces T p M,p £ M, equipped with the following structure of a differentiable 
manifold: First let n : TM — > M with n(w) = p for w £ T p M be the projec- 
tion onto the “base point” . If x : U — » is a chart for M, we let TU be the 
disjoint union of the T p M with p £ U and define the chart 

dx:TU -> Tx{U) (:= (J T p M) 

pGx(U) 

where Tx(U) carries the differentiable structure of x(U) x 
w i-> dx(n(w))(w) £ T x („( w ))x(U). 

The transition maps 

dx' o ( dx ) _1 = d(x r ox -1 ) 
then are differentiable, tt is locally represented by 

x o 7r o dx^ 1 
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and this map maps (xo,v) £ Tx(U) to Xq. 

Definition 1.2.2 The triple (TM,n,M) is called the tangent bundle of M, 
and TM is called the total space of the tangent bundle. 


Finally, we briefly discuss the case of a complex manifold M, to have it 
at our disposal in 5.2. With the previous constructions and conventions in 
the real case understood, we let z J = x J + iy 1 again be local holomorphic 
coordinates near 2 : £ M, as at the end of 1.1. TfM := T Z M is the ordinary 
(real) tangent space of M at z, and 

TfM ■- TfM <g> K C 

is the complexified tangent space which we then decompose as 

TfM = <C{ — , =: T'M © TfM 

z dzi dzi 

where T' Z M = C{g^j} is the holomorphic and TfM = the antiholo- 

morphic tangent space. In TfM , we have a conjugation, mapping ff to ,ff , 
and so, TfM = T' Z M. The projection TfM — > TfM — » T' Z M is an K-linear 
isomorphism. 


Perspectives. Other definitions of the tangent space of a differentiable manifold 
M are possible that are more elegant and less easy to compute with. 

A germ of a function at x £ M is an equivalence class of smooth functions 
defined on neighborhoods of x, where two such functions are equivalent if they 
coincide on some neighborhood of x. A tangent vector at x may then be defined as 
a linear operator 5 on the function germs at x satisfying the Leibniz rule 

S(f ■ g)(x) = ( Sf{x))g(x ) + f(x)6g(x). 

This definition has the obvious advantage that it does not involve local coordinates. 


1.3 Submanifolds 

A differentiable map / : M — > N is called an immersion , if for any x £ M 

df : T X M T f(x) N 

is injective. In particular, in this case m := dim M < n := dim N. If an 
immersion / : M — » IV maps M homeomorphically onto its image in N, f is 
called differentiable embedding. The following lemma shows that locally, any 
immersion is a differentiable embedding: 
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Lemma 1.3.1 Let f : M — > N be an immersion, dimM = m,dimiV = 
n, x € M. Then there exist a neighborhood U of x and a chart (V, y) on N 
with f(x) £ V , such that 

(i) fju is a differentiable embedding, and 

(a) y m+1 {p) = ■ ■ ■ = y n (p) = 0 for all p e f(U) n V. 


Proof. This follows from the implicit function theorem. In local coordinates 
(z 1 ,. . . , z n ) on N, (x 1 ,. . . , x m ) on M let, w.l.o.g. (since df(x) is injective) 

f dz a (f(x)) \ 

V dxi ) 

be nonsingular. 

We consider 

F(z,x) := (z 1 -f 1 (x),...,z n -f n (x)), 

which has maximal rank in x 1 , . . . , x m , z m+1 , . . . ,z n . By the implicit function 
theorem, there locally exists a map 

(z 1 , . . . , z m ) ~ (^(z 1 , . . . , z m ), . . . , <p n (z\ ■ • ■ , ^ m )) 

with 

F(z,x) = 0 ^ x 1 = <p\z\ . . . ,z m ), . . . ,x m = ■ • ,z m ), 

z m+1 = T m+ \z\ . . . , z m ), ...,z n = V n (z\ z m ), 


for which has maximal rank. 

As new coordinates, we now choose 




z m+l _ ^+l (z l 7 . . . , *">), , *"*)). 


Then 

2 = f(x) 

<f=> F(z, x) = 0 

^ ( y 1 ,---,y n ) = {x 1 ,- ■ ■ ,x m ,o, . . . ,o), 

and the claim follows. □ 


If f : M — > N is a differentiable embedding, f{M) is called a differentiable 
submanifold of N. A subset N' of N , equipped with the relative topology, thus 
is a differentiable submanifold of N, if N' is a manifold and the inclusion is 
a differentiable embedding. 
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Charts on N' then are simply given by restrictions of charts of N to 
N' , and Lemma 1.3.1 shows that one may here always find a particularly 
convenient structure of the charts. 

Similarly, the implicit function theorem implies 

Lemma 1.3.2 Let f : M — > N be a differentiable map, dimM = m, dim TV = 
n,m > n, p £ N. Let df{x) have rank n for all x £ M with f(x) = p. Then 
f~ 1 (p) is a union of differentiable submanifolds of M of dimension m — n. 


Proof. We again represent the situation in local coordinates around x £ M 
and p = f(x) £ N. Of course, in these coordinates df(x) still has rank n. By 
the implicit function theorem, there exist an open neighborhood U of x and 
a differentiable map 


g(x n+1 , . . . , x m ) : U 2 C R m " n 


with 


U = U 1 xU 2 


f(x) = P <*=► (x\...,x n )=g(x n+ 1 ,...,x m ). 

With 

y a — x a — g(x n+1 ,...,x m ) for a = 1 

y s = X s for s = n + 1 , . . . , m 

we then get coordinates for which 


f(x) = p <==>■ y a = 0 for a = 1, . . . , n. 

( y n+1 , . . . ,y m ) thus yield local coordinates for {f(x) = p} and this implies 
that in some neighborhood of x {f(x) = p} is a submanifold of M of dimen- 
sion m — n. □ 


Let M be a differentiable submanifold of N, and let i : M — > N be 
the inclusion. For p £ M, T p M can then be considered as subspace of T p N, 
namely as the image di(T p M). 

The standard example is the sphere 

S n = {x £ K n+1 : |s| = 1} C M” +1 . 

By the Lemma 1.3.2, S n is a submanifold of M” +1 . 

Lemma 1.3.3 In the situation of Lemma 1.3.2, we have for the submanifold 
X = / -1 (p) and q £ X 


T q X= ker df(q)cT q M. 
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Proof. Let v £ T q X , (ip, U ) a chart on X with q £ U. Let 7 be any smooth 
curve in <p(U) with 7(0) = <p(q), 7(0) := ^7(t) |t=o = dip(v), for example 
7 (t) = <p(q) + tdtp(v). c := <p _1 (7) then is a curve in X with c( 0) = u. 
Because of X = f~ 1 (p), 

f o c(t) = p V t, 

hence d/(g) o c(0) = 0, and consequently v = c(0) £ ker df(q). Since also 
T q X = dim ker df{q) = m — n, the claim follows. □ 

For our example S n , we may choose 

/ : K” +1 — > R, f{x) = \x\ 2 . 

Then 

T x S n = ker df(x) = {u £ K n+1 : x ■ v(= xW) = 0}. 


Perspectives. H. Whitney (1936) showed that any d-dimensional differentiable 
manifold can be embedded into R 2d+1 . Thus, the class of abstract differentiable 
manifolds is the same as the class of submanifolds of Euclidean space. Neverthe- 
less, the abstract and intrinsic point of view offers great conceptual and technical 
advantages over the approach of submanifold geometry of Euclidean spaces. 


1.4 Riemannian Metrics 

We now want to introduce metric structures on differentiable manifolds. 
Again, we shall start from infinitesimal considerations. We would like to be 
able to measure the lengths of and the angles between tangent vectors. Then, 
one may, for example, obtain the length of a differentiable curve by integra- 
tion. In a vector space such a notion of measurement is usually given by a 
scalar product. We thus define 

Definition 1.4.1 A Riemannian metric on a differentiable manifold M is 
given by a scalar product on each tangent space T p M which depends smoothly 
on the base point p. A Riemannian manifold is a differentiable manifold, 
equipped with a Riemannian metric. 

In order to understand the concept of a Riemannian metric, we again need 
to study local coordinate representations and the transformation behavior of 
these expressions. 

Thus, let x = (a: 1 , . . . , x d ) be local coordinates. In these coordinates, a 
metric is represented by a positive definite, symmetric matrix 


(dij 
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(i.e. g Z j = g jl for all i,j, > 0 for all £ = (£\ . . . , f d ) ^ 0 ), where the 

coefficients depend smoothly on x. The transformation formula (1.4.3) below 
will imply that this smoothness does not depend on the choice of coordinates. 
Therefore, smooth dependence on the base point as required in def. 1.4.1 can 
be expressed in local coordinates. 

The product of two tangent vectors v, w € T p M with coordinate repre- 
sentations (v 1 , . . . ,v d ) and (w 1 , . . . ,w d ) (i.e. v = = wdg|y) then 

is 

(v,w) := gij(x(p))v z w 3 . (1.4.1) 

In particular, (gfi, gfj) = 9ij- 

Similarly, the length of v is given by 

IMI := {v,v)i. 

We now want to study the transformation behavior. Let y = f(x) define 
different local coordinates. In these coordinates, v and w have representations 
( 0 1 , . . . , v d ) and (w , . . . , w d ) with & = Let the metric in 

the new coordinates be given by hkffy). 

It follows that 

hke(f(x))v k w e = (v,w) = gij(x)v l w J , (1-4.2) 

hence 

df k df^ - 

fyw(/(z))^^yU*«d = g.ij(x)v l w J , 
and since this holds for all tangent vectors v,w, 

= 9ij(x). (1-4.3) 

Formula (1.4.3) gives the transformation behavior of a metric under co- 
ordinate changes. 

The simplest example of a Riemannian metric of course is the Euclidean 
one. For v = (v 1 , . . . , v d ), w = (w 1 ,...,w d ) € T x M. d , the Euclidean scalar 
product is simply 

c i 7 i i 

OijV W J = V w , 

where 

c / 1 for i = j 
(0 for i-ff j 

is the standard Kronecker symbol. 

Theorem 1.4.1 Each differentiable manifold may be equipped with a Rie- 
mannian metric. 

Proof. Let {(x a ,U a ) : a € A} be an atlas, (ip a )a&A a partition of unity 
subordinate to ( U a ) a ^A (see Lemma 1.1.1) (for simplicity of notation, we use 
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the same index set for (<p a ) and (U a ); this may be justified by replacing the 
original covering ( U a ) by a locally finite refinement). 

For v, w £ T p M and a € A with p £ U a let the coordinate representations 
be (v„, . . . , v d ) and (w^, . . . , w d ). Then we put 

(v,w):= ^a0)«- 

oEi 

withpet/c 

This defines a Riemannian metric. (The metric is simply obtained by 
piecing the Euclidean metrics of the coordinate images together with the 
help of a partition of unity.) □ 

Let now [a, b] be a closed interval in R, 7 : [a, 6] — > M a smooth curve, 
where “smooth”, as always, means “of class C°°” . 

The length of 7 then is defined as 

b 

L{l) ■= J '^{0 \<» 

a 

and the energy of 7 as 

a 

(In physics, ^(7) is usually called “action of 7” where 7 is considered as the 
orbit of a mass point.) Of course, these expressions can be computed in local 
coordinates. Working with the coordinates (x 1 (7(f)), • ■ • ,x d ('y(t))) we use the 
abbreviation 

A t ) := m- 

Then 

b 

L{ 7) = J \Jg ij {x{'y(t)))x i {t)xi(t)dt 

a 

and 

b 

E (7) = ^J gij(x('l(t)))x l (t)x J (t)dt. 

a 

We also remark for later technical purposes that the length of a (continuous 
and) piecewise smooth curve may be defined as the sum of the lengths of the 
smooth pieces, and the same holds for the energy. 

On a Riemannian manifold M, the distance between two points p , q can 
be defined: 
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d(p,q) := inf {£(7) :j : [a, 6] — > M piecewise smooth curve with 

7(a) = P, 7(b) = <?}- 

We first remark, that any two points p,q £ M can be connected by a 
piecewise smooth curve, and d(p, q) therefore is always defined. Namely, let 

E p '■= {q € M : p and q can be connected by a 
piecewise smooth curve.} 

With the help of local coordinates one sees that E p is open. But then also 
M\E p = U is open. Since M is connected and E p ^ $ (p £ E p ), we 

q£E p 

conclude M = E p . 

The distance function satisfies the usual axioms: 

Lemma 1.4.1 

(i) d{p, q) > 0 for all p , q, and d(p , q) > 0 for all p ^ q 

(ii) d(p,q) = d(q,p) 

(Hi) d(p , q) < dip, r) + dir, q) (triangle inequality) for all points p,q,r £ 

M. 

Proof, (ii) and (iii) are obvious. For (i), we only have to show d(p,q) > 0 for 
p yf q. For this purpose, let x : U — > be a chart with p £ U. Then there 

exists e > 0 with 

D e ix(p)) := {y £ R d : \y - xip) \ < e} C z(f7) 

(the bars denote the Euclidean absolute value) and 

q x^iDzixip))). (1-4.4) 

Let the metric be represented by (gij(x)) in our chart. Since (<77(2)) is pos- 
itive definite and smooth, hence continuous in x and H e (a;(p)) is compact, 
there exists A > 0 with 

9ii(v)et J > A l^| 2 (1-4-5) 

for all y £ D e ix(p)),f = (^ 1 ,-.-,^ d ) € Therefore, for any curve 7 : 
[a, b] — > M with 7(a) = p, jib) = q 

Li 7 ) > LijCix~ 1 iD e ixip))) 

> Ae > 0, (1.4.6) 

because x(j) by (1-4.4) has to contain a point z £ dD s (xip)), i.e. a point 
whose Euclidean distance from xip) is e. By (1.4.5), z then has distance from 
x(p) at least Ae w.r.t. the metric (37). □ 
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Corollary 1.4.1 The topology on M induced by the distance function d co- 
incides with the original manifold topology of M. 

Proof. It suffices to show that in each chart the topology induced by d co- 
incides with the one of K d , i.e. the one induced by the Euclidean distance 
function. Now for every x in some chart, there exists e > 0 for which D e (x) 
is contained in the same chart, and positive constants A, p with 

A|£| 2 < gij(y)C € 3 < At|C| 2 for all y G D e {x),£ G R d . 

Thus 

A| y — x\< d(y, x) < y\y - x\ for ally G D e (x), 

and thus each Euclidean distance ball contains a distance ball for d, and vice 
versa (with 

B(z,S) := {y G M : d{z,y) < 5} 

we have 

D\s{x) C B(x,6) C D^{x), if < e). □ 


We now return to the length and energy functionals. 

Lemma 1.4.2 For each smooth curve 7 : [a, b } — > M 

L( 7 ) 2 <2(&-a)£( 7 ), (1.4.7) 

and equality holds if and only = const. 

Proof. By Holder’s inequality 



with equality precisely if ||^|| = const. □ 

Lemma 1.4.3 If j : [a, b] — > M is a smooth curve, and ip : [a,/?] — » [a, b] is 
a change of parameter, then 

1,(70 ip) = L( 7 ). 


Proof. Let t = ^(r). 

P / 2\ \ 

L(joip) = J(g ij (x{^(ip{T))))P(ip(T))x J {ip{T))(^ffj j dr 

a ' ' 

(by the chain rule) 

= i( 7 ) (by change of variables). 


□ 
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Lemma 1.4.4 The Euler-Lagrange equations for the energy E are 


x l (t) + r l A k (x{t))x-i (t)x k (t) = 0, i = l,...,d (1.4.8) 


with 

where 

and 


Fjk — 2 ^ (9jt,k T 9ki,j 9jk,e), 


= ( 9ij ) 1 (i-e. g lt gtj = <%) 


9iW ~ q x i ; 9 j t- 


The expressions Tf k are called Christoffel symbols. 
Proof. The Euler-Lagrange equations of a functional 

b 


are given by 


I{x) = J f(t,x(t),x(t))dt 


±di di 

dt dx i dx i ’ ’ ’ 


In our case, we therefore get (E( 7) = \J gjk{x{t))x^x k dt) 

^{gik(x(t))x k (t) + gji{x(t))x J (t)) - g jk ,i(x(t))x J (t)x k (t) = 0 fori = 1, . . . ,d, 
hence 

gikX k + gjiX° + gik/x e x k + gji,ex e x J - g jkti x J x k = 0. 

Renaming some indices and using the symmetry gi k = g k i, we get 

^gemX T (gek,j T 9 j£,k gjk,e)i j x 0 L 1, • • • > d (1.4.9) 

and from this 

9 9 £mX T -g (gtk,j + 9j£,k 9 jk.£)x^ x ' 0 i — 1, . . . , d. 

Because of 

g l( gtm = S im , and thus g lt ge m x m = x l 
we obtain (1.4.8) from this. □ 

Definition 1.4.2 A smooth curve 7 = [a, b\ — ■> M, which satisfies (with 

**(*) = T'W) etc -) 

x z (t) + r^ k (x{t))xP (t)x k (t) = 0 for * = 1 , . . . , d 
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is called a geodesic. 

Thus, geodesics are the critical points of the energy functional. By Lemma 
1.4.3, the length functional is invariant under parameter changes. As in the 
Euclidean case, one easily sees that regular curves can be parametrized by arc 
length. We shall attempt to minimize the length within the class of regular 
smooth curves, and we shall succeed and complete the program in Corollary 
1.4.2 below. As the length is invariant under reparametrization by Lemma 
1.4.3, therefore, if one seeks curves of shortest length, it suffices to consider 
curves that are parametrized by arc length. For such curves, by Lemma 1.4.2 
one may minimize energy instead of length. Conversely, every critical point 
of the energy functional, i.e. each solution of (1.4.8), i.e. each geodesic, is 
parametrized proportionally to arc length. 

Namely, for a solution of (1.4.8) 

J t ^^) = J t ( 9 ij{x(t))x l {t)x 3 (t)) 

= gijX l x 3 + giji: l x 3 + gij t kx l x 3 x k 
= + ge jt k - gtk,j)x e x k x 3 + ge jtk x k x e x 3 

by formula (1.4.9) which is equivalent to (1.4.8) 

= 0, since gj k /x e x k x 3 = gi k ,jx e x k x 3 

by interchanging the indices j and £ 

Consequently ( x,x ) = const., and hence the curve is parametrized propor- 
tionally to arc length. We have shown 

Lemma 1.4.5 Each geodesic is parametrized proportionally to arc length. 

□ 

Theorem 1.4.2 Let M be a Riemannian manifold, p £ M,v £ T p M. Then 
there exist e > 0 and precisely one geodesic 

c : [0, e] -» M 

with c(0) = p, c(0) = v. In addition, c depends smoothly on p and v. 

Proof. (1.4.8) is a system of second order ODE, and the Picard-Lindelof The- 
orem yields the local existence and uniqueness of a solution with prescribed 
initial values and derivatives, and this solution depends smoothly on the data. 

□ 

We note that if x(t) is a solution of (1.4.8), so is x(Xt) for any constant 
Agl. Denoting the geodesic of Theorem 1.4.2 with c(0) = p, c(0) = v by c„, 
we obtain 

c v {t) = c A „(-r) for A > 0,< G [0,e]. 

A 
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In particular, c\ v is defined on [0, f]. 

Since c v depends smoothly on v, and {v € T p M : ||v|| = 1} is compact, 
there exists eo > 0 with the property that for ||v| = 1 c v is defined at least 
on [0,£o]. Therefore, for any w £ T p M with ||w|| < e 0: c w is defined at least 
on [0,1]. 

Definition 1.4.3 Let M be a Riemannian manifold, p £ M, 

V p :={v £ T p M : c v is defined on [0, 1]} 
exp p :V p — > M 
v c„(l) 

is called the exponential map of M at p. 

By the preceding considerations, the domain of definition of the expo- 
nential map always at least contains a small neighborhood of 0 € T p M. In 
general, however, V p is not all of T p M, as is already seen in the example of a 
proper, open subset of K d , equipped with the Euclidean metric. Nevertheless, 
we shall see in Theorem 1.4.7 below that for a compact Riemannian manifold, 
exp p can be defined on all of T p M. 

Theorem 1.4.3 The exponential map exp p maps a neighborhood, of 0 € T p M 
diffeomorphically onto a neighborhood of p £ M. 

Proof. Since T p M is a vector space, we may identify T 0 T p M, the tangent 
space of T p M at 0 £ T p M , with T p M itself. The derivative of exp p at 0 then 
becomes a map from T p M onto itself: 

dexp p (0) : T p M —> T p M. 

With this identification of TqT p M and T p M , for v £ T p M 
dexp p (0)O) = fffCtv ( 1) |t— o 

= ^(% =0 

= c„(0) 

= V 

Hence 

dexp p (0) = id\ TpM - (1.4.10) 

In particular, dexp p (0) has maximal rank, and by the inverse function 
theorem, there exists a neighborhood of 0 £ T p M which is mapped diffeo- 
morphically onto a neighborhood of p £ M. □ 
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Let now ei, e 2 , . . . , (d = dim M) be a basis of T p M which is orthonor- 
mal w.r.t. the scalar product on T p M defined by the Riemannian metric. 
Writing for each vector v £ T p M its components w.r.t. this basis, we obtain 
a map 

<Z> : T p M -> 

v = v l e.t i— > (v 1 , . . . , v d ). 

For the subsequent construction, we identify T p M with via <P. By The- 
orem 1.4.3, there exists a neighborhood U of p which is mapped by exp~ 1 
diffeomorphically onto a neighborhood of 0 £ T p M, hence, with our identi- 
fication T p M = diffeomorphically onto a neighborhood 17 of 0 £ R d . In 
particular, p is mapped to 0. 

Definition 1.4.4 The local coordinates defined by the chart (exp” 1 ,!/) are 
called (Riemannian) normal coordinates with center p. 

Theorem 1.4.4 In normal coordinates, we have for the Riemannian metric 

9ij(0)=S ij (1.4.11) 

Tj k ( 0) = 0 (and also gij tk (0) = 0) for all i,j, k . (1.4.12) 

Proof. (1.4.11) directly follows from the fact that the above identification 
I? : T p M = maps an orthonormal basis of T p M w.r.t. the Riemannian 
metric onto an Euclidean orthonormal basis of R d . 

For (1.4.12), we note that in normal coordinates, the straight lines through 
the origin of (or, more precisely, their portions contained in the chart 
image) are geodesic. Namely, the line tv,t £ R,u £ R d , is mapped (for suffi- 
ciently small t) onto Ct„(l) = c v (t), where c v {t) is the geodesic, parametrized 
by arc length, with c„(0) = v. 

Inserting now x{t) = tv into the geodesic equation (1.4.8), we obtain 
because of x(t) = 0 

q k (tv)viv k = 0, fori = 1, . . . , d. (1.4.13) 

In particular at 0, i.e. for t = 0, 

q k (0)v j v k = 0 for allv € R d , i = 1, . . . , d . (1.4.14) 

We put v = \(ei + e m ) and obtain because of the symmetry T l - k = T k - 

Tfmi 0) = 0 for alii. 

Since this holds for all £,m, all Tj fc ( 0) vanish. By definition of T l - k , we obtain 
at 0 G R d 

9 l( {9je,k + 9 m, j ~ 9jk,£ ) = 0 V i,j, k, 


hence also 
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9jm,k T 9km, j 9jk,m — 0 ^ j) ^5 t/t . 

Adding now the relation (obtained by cyclic permutation of the indices) 

9k j ,m H - 9mj,k 9km, j — O 7 

we obtain (with gkj = gjk ) 

9jm,k(°) = 0, for all j,k,m. □ 


Later on (in Chapter 3), we shall see that in general the second derivatives 
of the metric cannot be made to vanish at a given point by a suitable choice 
of local coordinates. The obstruction will be given by the curvature tensor. 

Further properties of Riemannian normal coordinates may best be seen 
by using polar coordinates , instead of the Euclidean ones (obtained from the 
map d>). We therefore introduce on the standard polar coordinates 

(r, 

where ip = (ip 1 , . . . , ip 11 ^ 1 ) parametrizes the unit sphere S d_1 (the precise 
formula for ip will be irrelevant for our purposes), and we then obtain polar 
coordinates on T p M via d? again. We express the metric in polar coordinates 
and write g rr instead of t/n, because of the special role of r. We also write 
g rv> instead of gu, £ £ {2, . . . , d}, and g vv as abbreviation for (gki)k,t- 2 ,...,d- 
In particular, in these coordinates at 0 € T p M (this point corresponds to 
p £ M) 

g rr (0) = l, 9rv (0) = 0 (1.4.15) 

by (1.4.11) and since this holds for Euclidean polar coordinates. 

After these preparations, we return to the analysis of the geodesic equa- 
tion (1.4.8). The lines ip = const, are geodesic when parametrized by arc 
length. They are given by x(t) = (t, po), ipo fixed, and from (1.4.8) 

= 0 for all i 

(we have written T) r instead of J^), hence 

g ie (2g ritr - g rr ,e ) = 0, for all i, 


thus 


2 g r i, r — g r r,e = 0, for all £. 


For £ = r, we conclude 
and with (1.4.15) then 
Inserting this in (1.4.16), we get 


9rr,r U, 
g rr = 1. 


(1.4.16) 


(1.4.17) 
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9rtp,r U, 

and then again with (1.4.15) 

g rv = 0. (1.4.18) 

We have shown 

Theorem 1.4.5 For the polar coordinates, obtained by transforming the Eu- 
clidean coordinates o/R d , on which the normal coordinates with centre p are 
based, into polar coordinates, we have 

0 ... 0 

3w( r . P) 

where g V ip{r,<p) is the (d— 1) x (d— 1) matrix of the components of the metric 
w.r.t. angidar variables (< p 1 , . . . ,ip d ~ 1 ) £ S d ~ 1 . □ 

The polar coordinates of Theorem 1.4.5 are often called Riemannian po- 
lar coordinates. The situation is the same as for Euclidean polar coordinates: 
For example in polar coordinates on R , the Euclidean metric is given by 

^ q ^2 ^ • We point out once more that in contrast to Theorem 1.4.4, The- 
orem 1.4.5 holds not only at the origin 0 € T p M, but in the whole chart. 

Corollary 1.4.2 For any p £ M, there exists p > 0 such that Riemannian 
polar coordinates may be introduced on B(p,p) := {<7 £ M : d(p,q) < p). For 
any such p and any q £ dB(p,p), there is precisely one geodesic of shortest 
length (= p) from p to q, and in polar coordinates, this geodesic is given 
by the straight line x(t) = (t, ipo), 0 < t < p, where q is represented by 
the coordinates (p,(po)i‘Po G S rf_1 . Here, “of shortest length” means that the 
curve is the shortest one among all curves in M from p to q. 

Proof. The first claim follows from Corollary 1.4.1 (and its proof) and The- 
orem 1.4.3. For the second claim, let c(t) = 0 < t < T, be an 

arbitrary curve from p to q. c(t ) need not be entirely contained in B{p,p) 
and may leave our coordinate neighborhood. Let 

t.Q := inf{t < T : d(x(t),p) > p}. 

Then to < T , and the curve C| [o,* 0 ] is entirely contained in B(p,p). We shall 
show T(c|[o,t 0 ]) > p with equality only for a straight line in our polar coor- 
dinates. This will then imply the second claim. The proof of this inequality 
goes as follows: 
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L ( c |[o,t 0 ]) = J( 9 ij{c(t))c i c j )idt 
0 
to 

> f (g rr (c(t))rr)idt 
o 

= f\r\dt > frdt 
0 0 
= r(t 0 ) = p 


by (1.4.18) and 

since g vv is positive definite 

by (1.4.17) 

by definition of to, 


and equality holds precisely if g^fxp = 0, in which case f>(t) is constant and 
r > 0 and c(t) thus is a straight line through the origin. □ 


In particular, under the assumptions of Corollary 1.4.2, the Euclidean ball 
d p ( 0) := {y£R d : \y\ < p} C T p M 

is mapped under exp p cliffeomorphically onto the Riemannian ball with the 
same radius, 

B{p,p). 

Corollary 1.4.3 Let M be a compact Riemannian manifold. Then there ex- 
ists p o > 0 with the property that for any p £ M, Riemannian polar coordi- 
nates may be introduced on B(p,po). 

Proof. By Corollary 1.4.2, for any p £ M, there exists p > 0 with those 
properties. By Theorem 1.4.2, exp p is smooth in p. If thus exp p is injective 
and of maximal rank on a closed ball with radius p in T p M, there exists a 
neighborhood U of p such that for all q £ U, exp q is injective and of maximal 
rank on the closed ball with radius p in T q M. 

Since M is compact, it can be covered by finitely many such neighbor- 
hoods and we choose po as the smallest such p. □ 

Corollary 1.4.4 Let M be a compact Riemannian manifold. Then there ex- 
ists po > 0 with the property that, any two points p,q £ M with d(p, q) < po 
can be connected by precisely one geodesic of shortest, length. This geodesic 
depends continuously on p and q. 

Proof, po from Corollary 1.4.3 satisfies the first claim by Corollary 1.4.2. 
Moreover, by the last claim of Corollary 1.4.2, the shortest geodesic from p 
to q £ B(p,po) depends continuously on p. Exchanging the roles of p and q 
yields the continuous dependence on p as well. □ 


We explicitly point out that for any compact Riemannian manifold there is 
always more than one geodesic connection between any two points (This will 
be discussed in Chapter 5.). Only the shortest geodesic is unique, provided p 
and q are sufficiently close. 
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We now want to show the existence of shortest (geodesic) connections 
between any two points of arbitrary distance on a given compact Riemannian 
manifold. 

We recall the notion of homotopy between curves: 


Definition 1.4.5 Two curves 70,71 on a manifold M with common initial 
and end points p and q , i.e. two continuous maps 


70,71 : I = [0, 1] -> M 


with 7o(0) = 71(0) = p, 70(1) = 7i(l) = q, are called homotopic if there 
exists a continuous map 

r-.ixi^M 

with 

T( 0, s) = p, T(l, s) = q for all s £ I 

r(t, 0) = 70(f), r(t, 1) = 71(f) for all t € I 

Two closed curves co,ci in M, i.e. two continuous maps 

c 0 , Ci : S 1 -> M, 


are called homotopic, if there exists a continuous map 

c : S 1 x / -» M 


with 

c(t, 0) = co(t), c(t, 1) = ci(t) for all t G S 1 
(S 1 , as usual, is the unit circle parametrized by [0, 2n).). 

Lemma 1.4.6 The concept of homotopy defines an equivalence relation on 
the set of all curves in M with fixed initial and end points as well as on the 
set of all closed curves in M. 

The proof is elementary. □ 

With the help of this concept, we now want to show the existence of 
geodesics: 

Theorem 1.4.6 Let M be a compact Riemannian manifold, p,q £ M. Then 
there exists a geodesic in every homotopy class of curves from p to q, and this 
geodesic may be chosen as a shortest curve in its homotopy class. Likewise, 
every homotopy class of closed curves in M contains a curve which is shortest 
and geodesic. 

Proof. Since the proof is the same in both cases, we shall only consider the 
case of closed curves. 


As a preparation, we shall first show 
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Lemma 1 . 4.7 Let M be a compact Riemannian manifold, po > 0 as in 
Corollary l.f.f. Let 70,71 : S 1 — > M be curves with 

d( 7 o(t), 7 i(t)) < Po for all t G S' 1 . 

TTien 70 and 71 are homotopic. 

Proof. For any t G S 1 let c*(s) : 1 —> M be the unique shortest geodesic from 
70 (t) to 71 (t) (Corollary 1 . 4 . 4 ), as usual parametrized proportionally to arc 
length. Since c* depends continuously on its end points by Corollary 1 . 4 . 4 , 
hence on t, 

r(t,s) := Ct(s ) 

is continuous and yields the desired homotopy. □ 

Proof of Theorem l.f.6: Let (7 n )„gN be a minimizing sequence for arc 
length in the given homotopy class. Here and in the sequel, all curves are 
parametrized proportionally to arc length. We may assume w.l.o.g. that the 
curves 7„ are piecewise geodesic; namely, for each curve, we may find to = 
0 < ti < t-2 < . . . < t m < t m + 1 = 27 t with the property that 

< Po/2 

(po as in Corollary 1 . 4 . 4 ) for j = 1 , . . . , m + 1 with t m+ 1 := 2n). 

Replacing j n \ [t j _ 1 ,t j ] by the shortest geodesic arc between and 7 

we obtain a curve which is homotopic to and not longer than 7„ (the same 
argument also shows that each homotopy class does contain curves of finite 
length). 

We may thus assume that for any 7„ there exist points po,n> • • • ,Pm,n for 
which d(jpj-i tn ,pj >n ) < Po (p m +i,n ■= Po,n,j = 1 , • • ■ ,m+ 1 ) and for which 
contains the shortest geodesic arc between p : j-\. n and Pj, n - Since the lengths 
of the 7 n are bounded as they constitute a minimizing sequence, we may also 
assume that m is independent of n. After selection of a subsequence, by the 
compactness of M, the points po, n , ■ ■ ■ ,Pm,n converge to points po , . . . , p rn , 
for n — » oo. The segment of 7 n between Pj~i yn and Pj,n then converges to the 
shortest geodesic arc between Pj-i and pj, for example by Corollary 1 . 4 . 4 . 
The union of these geodesic segments yields a curve 7. By Lemma 1 . 4 . 7 , 7 is 
homotopic to the 7„, and 


L{ 7) = lim L(7„), 

n—*oo 

and since the curves 7„ are minimizing sequence for the length in their homo- 
topy class, 7 is a shortest curve in this class. Therefore, 7 has to be geodesic. 
Namely, otherwise, there would exist points p and q on 7 for which one of the 
two segments of 7 between p and q would have length at most po , but would 
not be geodesic. By Corollary 1 . 4 . 4 , 7 could then be shortened by replacing 
this segment by the shortest geodesic arc between p and q. By the argument 
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of Lemma 1.4.7, this does not change the homotopy class, and we obtain a 
contradiction to the minimizing property of 7. 

7 thus is the desired closed geodesic. □ 

Corollary 1.4.5 On any compact Riemannian manifold Mi, any two points 
p , q can be connected by a curve of shortest length , and this curve is geodesic. 

Proof. Minimize over all curves between p and q (and not only over those in 
a fixed homotopy class) as in the proof of Theorem 1.4.6. □ 

We also show 

Theorem 1.4.7 Let M be a compact Riemannian manifold. Then for any 
p £ M , the exponential map exp p is defined on all of T p M , and any geodesic 
may be extended indefinitely in each direction. 

Proof. For v £ T p M , let 

A := {t £ R + : c v is defined on [— t, f]}, 

where c v is, as usual, the geodesic with c„(0) = p,c v { 0) = v. It follows 
from c v ( — t) = C- V (t) that e v may also be defined for negative t, at the 
moment at least for those with sufficiently small absolute value. Theorem 
1.4.2 implies A ^ 0. The compactness of M implies the closedness of A. We 
shall now show openness of A : Let c v be defined on [— f, t]; for example 
c v (t) = w £ T Cv ( t )M. By Theorem 1.4.2 there exists a geodesic "Y w (s) with 
7™(0) = c v (t), 7^(0) = tuft), for s £ [0, e] and £ > 0. Putting c t ,(<+s) = 7 w (s) 
for s £ [0 , er] , we have extended c v to [— t, £ + e]. Analogously, c v may be 
extended in the direction of negative t. This implies openness of A , hence 
A = R + . The claims follow easily. □ 

We now want to address the question whether the results of Theorem 
1.4.7 continue to hold for a more general class of Riemannian manifolds than 
the compact ones. Obviously, they do hold for Euclidean space which is not 
compact, but they do not hold for any proper open subset of Euclidean space, 
essentially since such a set is not complete. It will turn out that completeness 
will be the right condition for extending Theorem 1.4.7. 

Definition 1.4.6 A Riemannian manifold M is geodesically complete if for 
all p £ M, the exponential map exp p is defined on all of T p M , or, in other 
words, if any geodesic c(t) with c(0) = p is defined for all t £ R. 

We can now state the Theorem of Hopf-Rinow. 

Theorem 1.4.8 Let M be a Riemannian manifold. The following statements 
are equivalent: 
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(i) M is complete as a metric space (or equivalently, it is complete as a 
topological space w.r.t. its underlying topology, see Corollary l.f.l). 

(ii) The closed and hounded subsets of M are compact. 

(Hi) There exists p £ M for which exp p is defined on all of T p M. 

(iv) M is geodesically complete, i.e. for every p £ M, exp p is defined on 
all of T p M. 

Furthermore, each of the statements (i) - (iv) implies 

(v) Any two points p,q £ M can he joined by a geodesic of length d(p, q), 
i.e. by a geodesic of shortest length. 

Proof. We shall first prove that if exp p is defined on all of T p M , then any 
q £ M can be connected with p by a shortest geodesic. In particular, this will 
show the implication (iv) => (v). 

For this purpose, let 

r := d(p, q), 

and let p > 0 be given by Corollary 1.4.2, let po £ dB(p,p) be a point 
where the continuous function d(q, •) attains its minimum on the compact set 
dB(p,p). Then p 0 = exp p pV, for some V £ T p M. We consider the geodesic 

c(t) := exp p tV, 

and we want to show that 

c(r ) = q. (1.4.19) 

C| [o,r] w iU then be a shortest geodesic from p to q. 

For this purpose, let 

I :={t£ [0, r] : d(c(t),q) = r-t}. 

(1.4.19) means r £ I, and we shall show I = [0,r] for that purpose. I is 
not empty, as it contains 0 by definition of r, and it is closed for continuity 
reasons. I = [0, r] will therefore follow if we can show openness of I. 

Let to £ I- Let p\ > 0 be the radius of Corollary 1.4.2 corresponding to 
the point c(<o) G M. W.l.o.g. p\ < r — to- Let p\ £ dB(c(t 0 ), pi) be a point 
where the continuous function d{q , •) assumes its minimum on the compact 
set dB(c(to), pi). Then 


d{p,Pi) > d(p, q) - d{q,p 1 ). 

Now for every curve 7 from c(to) to q, there exists some 

7 (t) G dB(c(t 0 ),p{). 


(1.4.20) 


Hence 
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L{ 7) > d(c(t 0 ),j(t)) + d(^{t),q) 

= Pi + d{^{t),q) 

> pi + d(pi,q) because of the minimizing property of p\ . 


Hence also 

d(q,c(t 0 )) > pi + d(p 1 ,q) (1.4.21) 

and by the triangle inequality, we then actually must have equality. Inserting 
(1.4.21) into (1.4.20) and recalling d(q,c(to)) = 1 — to gives 

d(p,pi) >r-(r-t 0 -pi) = t 0 + pi- 

On the other hand, there exists a curve from p to p\ of length to + Pi] namely 
one goes from p to c(to) along c and then takes the geodesic from c(to) to p\ 
of length pi . That curve thus is shortest and therefore has to be geodesic as 
shown in the proof of Theorem 1.4.6. By uniqueness of geodesics with given 
initial values, it has to coincide with c, and then 

Pi = c(f 0 + pi). 

Since we observed that equality has to hold in (1.4.21), we get 
d(q, c(t 0 + pi)) = r - (t 0 + pi), 


hence 

to + Pi G I, 

and openness of I follows, proving our claim. 

It is now easy to complete the proof of Theorem 1.4.8: 

(iv) => (iii) is trivial 

(iii) =t> (ii) Let K C M be closed and bounded. Since bounded, K C 
B{p , r) for some r > 0. By what we have shown in the beginning, any point in 
B(p, r) can be connected with p by a geodesic (of length < r). Hence, B(p , r) 
is the image of the compact ball in T p M of radius r under the continuous 
map exp p . Hence, B(p,r) is compact itself. Since K is assumed to be closed 
and shown to be contained in a compact set, it must be compact itself. 

(ii) => (i) Let (p„) n6 N C M be a Cauchy sequence. It then is bounded, 
and, by (ii), its closure is compact. It therefore contains a convergent subse- 
quence, and being Cauchy, it has to converge itself. This shows completeness 
of M. 

(i) =>• (iv) Let c be a geodesic in M , parametrized by arc length, and 
being defined on a maximal interval I. I then is nonempty, and by Theorem 
1.4.2, it is also open. To show closedness, let (fn)„ e N C I converge to t. 

Since 


d(c(t n ),c(t m ))<|t n tm | 
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as c is parametrized by arc length, c(t n ) is a Cauchy sequence, hence has 
a limit p € M, because we assume M to be complete. Let p > 0 be as in 
Corollary 1.4.2. Then B(p, p) is compact, being the image of the compact ball 
of radius r in T p M under the continuous map exp p . Therefore, the argument 
of Corollary 1.4.3 and Corollary 1.4.4 applies to show that there exists po > 0 
with the property that for any point q € B(p,p) any geodesic starting from 
q can be extended at least up to length po- 

Since c(t n ) converges to p, for all sufficiently large m, n 

d(c(t n ),c(t 

m ))<l* n tm I < Po/2 


and 

d(c(t n ),p),d(c(t m ),p) < po- 

Therefore, the shortest geodesic from c(t n ) to c(t m ) can be defined at least on 
the interval [— po,Po]- This shortest geodesic, however, has to be a subarc of 
c, and c thus can be defined up to the parameter value t n + po, in particular 
for t, showing closedness of I. □ 

Let now M be a differentiable submanifold of the Riemannian manifold 
N. The Riemannian metric of N then induces a Riemannian metric on M, by 
restricting the former one to T p M C T p N for p £ N. Thus, M also becomes 
a Riemannian manifold. 

In particular, S n C R" +1 obtains a Riemannian metric. We want to com- 
pute this metric in the local chart of 1.1, namely 

f{x\...,x n+1 ) = ( X \ ) for x n+1 ^ 1 

■’ v ’ ’ ' \1- x n+1 1 - x n+1 J 

= :(p 1 ,...,p”)GK". 

In the sequel, a Latin index occuring twice in a product has to be summed 
from 1 to n + 1, a Greek one from 1 to n. We compute 

1 = x i x i = y a y a { 1 - x n+1 ) 2 + x n+1 x n+1 

hence 


and then 


For g := f 1 then 

dg> 25 jk 4 yiy k 

dy k 1 + y a y a (1 + y a y a ) 2 

dg n+i _ Ay k 

dy k (1 + y a y a ) 2 ' 


x n +1 = y a y a - 1 

y a ya + l 


X = 


2y i 


1 + y a y° 


(i = 1, . . . , n). 


for j = 1, . . . , n, k = 1, . . . , n 
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Let a tangent vector to S n be represented by w = (w 1 , . . . , w n ) in our chart. 
Then 

(w,w) = dg(w) ■ dg(w), where the point denotes the Euclidean 

scalar product ofR n+1 

= ^ + ^4 W 1 + V a V a ) 2 ^ ~ 16(1 + 


+ 16 y 13 y 13 y' 1 ui^ y s w s + 1 Gy^w^y^w 7 } 




(1 + y a y a ) 2 
Thus, the metric in our chart is given by 


3ij(y)- (1 +|y|2)2<%- 


Definition 1.4.7 A diffeomorphism h : M — ■> N between Riemannian 
manifolds is an isometry if it preserves the Riemannian metric. Thus, for 
p € M,v,w £ T p M, and if (-,-)m and (-,-)n denotes the scalar products in 
T p M and T^^N, resp., we have 

( v,w) M = (dh(v),dh(w)) N . 

A differentiable map h : M — > N is a local isometry if for every p £ M there 
exists a neighborhood U for which h\jj : U — > h(U) is an isometry, and h(U) 
is open in N. 


If ( gij(p )) and ('y a p(h(p)) are the coordinate representations of the metric, 
an isometry has to satisfy 


9ij(p) 


7 <*p(h(p)) 


dh a (p) dhd(p) 
dx i dxJ 


A local isometry thus has the same effect as a coordinate change. Isometries 
leave the lengths of tangent vectors and therefore also the lengths and ener- 
gies of curves invariant. Thus, critical points, i.e. geodesics, are mapped to 
geodesics. 

With this remark, we may easily determine the geodesics of S n . The 
orthogonal group 0(n + l) operates isometrically on R" +1 , and since it maps 
S n into S n , it also operates isometrically on S n . Let now p £ S n ,v G T p S n . 
Let E be the two dimensional plane through the origin of K n+1 , containing v. 
We claim that the geodesic c v through p with tangent vector v is nothing but 
the great circle through p with tangent vector v (parametrized proportionally 
to arc length), i.e. the intersection of S n with E. For this, let S £ 0(n+l) be 
the reflection across that E. Together with c v , Sc v is also a geodesic through 
p with tangent vector v. The uniqueness result of Theorem 1.4.2 implies 
c v = Sc v , and thus the image of c v is the great circle, as claimed. 
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As another example, we consider the torus T 2 introduced in 1.1. We in- 
troduce a metric on T 2 by letting the projection 7r be a local isometry. For 
each chart of the form (U, (tt^) - 1 ), we use the Euclidean metric on 7r _1 (C/). 
Since the translations 

z i— > z + m\Wi + 1712 W 2 (mi, m 2 £ Z) 

are Euclidean isometries, the Euclidean metrics on the different components 
of tt~ 1 (U) (which are obtained from each other by such translations) yield 
the same metric on U. Hence, the Riemannian metric on T 2 is well defined. 

Since 7r is a local isometry, Euclidean geodesics of R 2 are mapped onto 
geodesics of T 2 . The global behavior of geodesics on such a torus is most 
easily studied in the case where T 2 is generated by the two unit vectors 
W\ = (1, 0) and w 2 = (0, 1) : A straight line in R 2 which is parallel to one of 
the coordinate axes then becomes a geodesic on T 2 that closes up after going 
around once. More generally, a straight line with rational slope becomes a 
closed, hence periodic geodesic on T 2 , while the image of one with irrational 
slope lies dense in T 2 . 

Before ending this paragraph, we want to introduce the following impor- 
tant notion: 

Definition 1.4.8 Let M be a Riemannian manifold, p £ M. The injectivity 
radius of p is 

i(p) := sup{p > 0 : exp p is defined on d p (0) C T p M and injective}. 

The injectivity radius of M is 

i(M) := inf i(p ). 

pGM 


For example, the injectivity radius of the sphere S n is n, since the expo- 
nential map of any point p maps the open ball of radius n in T p M injectively 
onto the complement of the antipodal point of p. 

The injectivity radius of the torus just discussed is since here the 
exponential map is injective on the interior of a square with centre 0 £ T p M 
and side length 1. 


Perspectives. As the name suggests, the concept of a Riemannian metric was 
introduced by B. Riemann, in his habilitation address [254]. He also suggested to 
consider more generally metrics obtained by taking metrics on the tangent spaces 
that are not induced by a scalar product. Such metrics were first systematically 
investigated by Finsler and are therefore called Finsler metrics. 

For a general metric space, a geodesic is defined as a curve which realizes the 
shortest distance between any two sufficiently close points lying on it. Those metric 
spaces that satisfy the conclusion of the Hopf-Rinow theorem that any two points 
can be connected by a shortest geodesic are called geodesic length spaces, and 
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they are amenable to geometric constructions as demonstrated by the school of 
Alexandrov. See e.g. [194], [15]. 

A Lorentz metric on a differentiable manifold of dimension d + 1 is given by an 
inner product of signature (l,d) on each tangent space T P M depending smoothly 
on p. A Lorentz manifold is a differentiable manifold with a Lorentz metric. The 
prototype is Minkowski space, namely R d+1 equipped with the inner product 

(x, y) = -x°y° + x 1 y 1 + . . . + x d y d 

for x — (x°, x 1 , . . . , x d ), y = (y° , y 1 ,..., y d ). Lorentz manifolds are the spaces oc- 
curing in general relativity. Let us briefly discuss some concepts. Tangent vectors 
V with negative, positive, vanishing ||V|| 2 = (V, V) are called time-like, space-like, 
and light-like, resp. Length and energy of a curve may be defined formally as in the 
Riemannian case, and we again obtain geodesic equations. Geodesics whose tangent 
vectors all have norm zero are called null geodesics. They describe the paths of light 
rays. (Note that in our above description of the Minkowski metric, the conventions 
have been chosen so that the speed of light is 1.) Submanifolds of Lorentz manifolds 
whose tangent vectors are all space-like are ordinary Riemannian manifolds w.r.t. 
the induced metric. For treatments of Lorentzian geometry, an introduction is [209]. 
Deeper aspects are treated in Hawking and Ellisfl 13] . 

J. Nash proved that every Riemannian manifold M can be isometrically em- 
bedded into some Euclidean space R fc . For the proof of this result, he developed an 
implicit function theorem in Frechet spaces and an iteration technique that have 
found other important applications. A simpler proof was found by Gunther[107]. 

Although on a conceptual level, Nash’s theorem reduces the study of Rieman- 
nian manifolds to the study of submanifolds of Euclidean spaces, in practice the 
intrinsic point of view has proved to be preferable (see Perspectives on 1.3). 

In our presentation, we only consider finite dimensional Riemannian manifolds. 
It is also possible, and often very useful, to introduce infinite dimensional Rieman- 
nian manifolds. Those are locally modeled on Hilbert spaces instead of Euclidean 
ones. The lack of local compactness leads to certain technical complications, but 
most ideas and constructions of Riemannian geometry pertain to the infinite di- 
mensional case. Such infinite dimensional manifolds arise for example naturally as 
certain spaces of curves on finite dimensional Riemannian manifolds. A thorough 
treatment is given in [159]. 


1.5 Vector Bundles 

Definition 1.5.1 A (differentiable) vector bundle of rank n consists of a 
total space E , a base M, and a projection tt : E — > M, where E and M 
are differentiable manifolds, n is differentiable, each “fiber” E x := w~ 1 (x) for 
x £ M, carries the structure of an n-dimensional (real) vector space, and the 
following local triviality requirement is satisfied: For each x £ M, there exist 
a neighborhood U and a diffeomorphism 

<p:Tr _1 (l7)->l7xR n 
with the property that for every y £ U 

Vy '■= V\ e v ■ E y -► {y} x R” 
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is a vector space isomorphism, i.e. a bijective linear map. Such a pair (ip, U) 
is called a bundle chart. 


In the sequel, we shall omit the word “differentiable” for a vector bundle. 
Often, a vector bundle will simply be denoted by its total space. 

It is important to point out that a vector bundle is by definition locally, 
but not necessarily globally a product of base and fiber. A vector bundle 
which is isomorphic to M x R" (n = rank) is called trivial. 

A vector bundle may be considered as a family of vector spaces (all iso- 
morphic to a fixed model R n ) parametrized (in a locally trivial manner) by 
a manifold. 

Let now ( E , n, M) be a vector bundle of rank n, (U a ) a& A a covering of M 
by open sets over which the bundle is trivial, and p a : n~ 1 (U oc ) —> U a x R n 
be the corresponding local trivializations. For nonempty U a fl Up, we obtain 
transition maps 

ipp a :U a nUp-> G1 (n,R) 


by 


Pp ° (/? Q 1 (a;,u) = {x, ppa(x)v) fora: £ U a C\ Up, v € R", (1.5.1) 


where Gl(n, R) is the general linear group of bijective linear self maps of R". 
The transition maps express the transformation behavior of a vector in the 
fiber under a change of local trivialization. 

The transition maps satisfy 


<Paa(x) = 

id*** 

for 

X G U oc 

(1.5.2) 

<Pafi(x)ipp a (x) = 

id]Rn 

for 

X G U(% 0 U / 3 

(1.5.3) 

ip ai (x)ip 1 p(x)ipp ct (x) = 

idRn 

for 

x g u at n up n u*y . 

(1.5.4) 


These properties are direct consequences of (1.5.1). 

A vector bundle can be reconstructed from its transition maps. 


Theorem 1.5.1 

E = [] U a x R n / ~ , 

aeA 

where II denotes disjoint union, and the equivalence relation ~ is defined by 
( x , v ) ~ ( y , w) : 4=> x = y and w = ipp a (x)v (x £ U a , y £ Up, v, w £ R ra ) 


The proof of this fact is a straightforward verification of the properties 
required in Definition 1.5.1. A reader who does not want to carry this out 
him/herself may consult D. Husemoller, Fibre bundles, Springer, GTM 20, 


1975. 


□ 
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Definition 1.5.2 Let G be a subgroup of Gl(n, R), for example O(n) or 
SO(n), the orthogonal or special orthogonal group. We say that a vector 
bundle has the structure group G if there exists an atlas of bundle charts for 
which all transition maps have their values in G. 

Definition 1.5.3 Let (E,n,M) be a vector bundle. A section of E is a 
differentiable map s : M —> E with n o s = id m- The space of sections of E 
is denoted by T(E). 

We have already seen an example of a vector bundle above, namely the 
tangent bundle TM of a differentiable manifold M. 

Definition 1.5.4 A section of the tangent bundle TM of M is called a vector 
field on M. 

Let now / : M — > N be a differentiable map, (E, n, N ) a vector bundle 
over N. We want to pull back the bundle via /, i.e. construct a bundle f*E, 
for which the fiber over x € M is Ef( x ), the fiber over the image of x. 

Definition 1.5.5 The pulled back bundle f*E is the bundle over M with 
bundle charts {jp o /, /~ 1 ({ 7 )), where (p, U ) are bundle charts of E. 

We now want to extend some algebraic concepts and constructions from 
vector spaces to vector bundles by performing them fiberwise. For example: 

Definition 1.5.6 Let (Ei,ni, M ) and (E 2 , 7 r 2 , M) be vector bundles over M. 
Let the differentiable map / : E\ —> E 2 be fiber preserving, i.e. 

7T 2 o / = 7Tl, 

and let the fiber maps f x : E\ tX — > E 2>x be linear, i.e. vector space homomor- 
phisms. Then / is called a bundle homomorphism. 

Definition 1.5.7 Let (E,tt,M) be a vector bundle of rank n. Let E' C E, 
and suppose that for any x G M there exists a bundle chart (p, U ) with 
x £ U and 

pin-^U) nE') = Ux M m (c U x R ", m < n). 

The resulting vector bundle (E', 7T|£/, M) is called subbundle of E of rank m. 

Let us discuss an example: S 1 = {x £ M 2 : |x| 2 = 1} is a submanifold of 
]R 2 . If we restrict the tangent bundle TR 2 of R 2 to S 1 , we obtain a bundle 
E over S 1 that is isomorphic to S 1 xl 2 . The tangent bundle of S 1 has fiber 
T x S l = {y € R 2 : x ■ y = 0} C R 2 (where the dot • denotes the Euclidean 
scalar product). TS 1 is a subbundle of TR 2 ^ 1 ; the reader is invited to write 
down explicit bundle charts. 
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Definition 1.5.8 Let M) and (E 2 ,ir 2 , M) be vector bundles over 

M. The Cartesian product of E\ and E 2 is the vector bundle over M with 
hber E\ tX x E 2 , x and bundle charts (tp a x ipp, U a fl Vp), where (1 p a , U a ) and 
(tppiVfi) are bundle charts for E\ and E 2 resp., and 

(ifia X ipp)(x, (v,w)) := (<Pa(x,v),ipp(x,w)) (v G E 1<x ,w G E 2 ,x)- 


Thus, the product bundle is simply the bundle with hber over x G M 
being the product of the fibers of E\ and E 2 over x. By this pattern, all 
constructions for vector spaces can be extended to vector bundles. Of partic- 
ular importance for us will be dual space, exterior and tensor product. Let 
us briefly recall the definition of the latter: 

Let V and W be vector spaces (as always over R) of dimension m and 
n, resp., and let (ei, . . . , e m ) and (/ 1 , . . . , /„) be bases. Then V ® W is the 
vector space of dimension mn spanned by the basis (e* <S> fj) i= 1 , . . . , m . There 

j = l,...,n 

exists a canonical bilinear map 

L :V xW -*V OFF 

mapping (a*ej, V fj) onto a*6 J ej <S) fj 

One may then also define the tensor product of more than two vector 
spaces in an associative manner. 


Definition 1.5.9 Let M be a differentiable manifold, x G M. The vector 
space dual to the tangent space T X M to R is called the cotangent space of 
M at the point x and denoted by T*M. The vector bundle over M whose 
fibers are the cotangent spaces of M is called the cotangent bundle of M and 
denoted by T*M. Elements of T*M are called cotangent vectors, sections of 
T*M are 1 -forms. 


We now want to study the transformation behavior of cotangent vectors. 
Let (ei)i = i,...,d be a basis of T X M and the dual basis of T*M, i.e. 


w J (ei) = Sj 


1 for i = j 
0 for i ^ j 


Moreover, let v — v l ei G T X M , 77 = r\jU)i G TfM. We have rj(v) = r]iv' 1 . Let 
the bases (e,;) and be given by local coordinates, i.e. 


ei = 


d_ 
dx i ’ 


w 7 = dxE 


Let now / be a coordinate change, v is transformed to 


f*(v) 



d 

W 
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g then has to be transformed to 


because in this case 


f)r‘i 

rw --=rh W 


df 


= mv 1 = g(v). 

Thus a tangent vector transforms with the functional matrix of the coordinate 
change whereas a cotangent vector transforms with the transposed inverse 
of this matrix. This different transformation behavior is expressed by the 
following definition: 


Definition 1.5.10 A p times contravariant and q times covariant tensor on 
a differentiable manifold M is a section of 

TM ® ...®TM <g) T*M (8> ... <8 T*M . 

' V ' ' V ' 

p times q times 


Actually, one should speak of a tensor field, because “tensor” often also 
means an element of the corresponding fibers, in the same manner, as a 
(tangent) vector is an element of T X M and a vector field a section of TM. 

If / is a coordinate change, a p times contravariant and q times covariant 
tensor is transformed p times by the matrix ( df ) and q times by the matrix 

(df- 1 )*. 


Lemma 1.5.1 A Riemannian metric on a differentiable manifold M is a two 
times covariant (and symmetric and positive definite) tensor on M. 

Proof. From the formula (1.4.3) for the transformation behavior of a Rieman- 
nian metric. □ 

A Riemannian metric thus is a section of T*M ® T*M. We consequently 
write the metric in local coordinates as 

gij(x)dx l (g> dx° . 

Theorem 1.5.2 The tangent bundle of a Riemannian manifold M of dimen- 
sion d has structure group 0(d). 

Proof. Let (/, U) be a bundle chart for TM, 

/ : tt-\U) -> U x R d . 
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Let ei ,ed be the canonical basis vectors of R d , and let V\,...,Vd be the 
sections of 7 t - 1 (17) with fivi) = e*, i = 1, . . . , d. Applying the Gram-Schmidt 
orthogonalization procedure to V\(x ), . . . , Vd(x) for each x € U we obtain 
sections w \ , . . . , Wd of 7r -1 (Z7) for which wi(x), . . . , Wd(x) are an orthonormal 
basis w.r.t. the Riemannian metric on T X M, for each x £ U. By 

/' : n-\U) -► U x R d 

A l Wi(x) (x, X 1 , . . . , X d ) 

we then get a bundle chart which maps the basis u>i(:r), . . . ,Wd(x ), i.e. an 
orthonormal basis w.r.t. the Riemannian metric, for each x £ U onto an 
Euclidean orthonormal basis of K d . We apply this orthonormalization process 
for each bundle chart and obtain a new bundle atlas whose transition maps 
always map an Euclidean orthonormal basis of R d into another such basis, 
and are hence in 0(d). □ 

We want to point out, however, that in general there do not exist local 
coordinates for which Wi(x) = for i = 1, . . . , d. 

Corollary 1.5.1 The tangent bundle of an oriented Riemannian manifold of 
dimension d has structure group SO (d). 

Proof. The orientation allows to select an atlas for which all transition maps 
have positive functional determinant. From this, one sees that we also may 
obtain transition functions for the tangent bundle with positive determinant. 
The orthonormalization process of Theorem 1.5.2 preserves the positivity of 
the determinant, and thus, in the oriented case, we obtain a new bundle atlas 
with transition maps in SO(d). □ 

Definition 1.5.11 Let be a a vector bundle. A bundle metric is 

given by a family of scalar products on the fibers E Xl depending smoothly 
on i € M. 

In the same manner as Theorem 1.5.2, one shows 

Theorem 1.5.3 Each vector bundle (E,ir,M) of rank n with a bundle 
metric has structure group 0(n). In particular, there exist bundle charts 
(fi U), f : 7 r _1 (t/) — » U x M n , for which for all x £ U, / -1 (;r, (ei, . . . , e„)) is 
an orthonormal basis of E x (ei, . . . ,e„ is an orthonormal basis o/E n ). □ 

Definition 1.5.12 The bundle charts of Theorem 1.5.3 are called metric. 

In the same manner as Theorem 1.4.1, one shows 

Theorem 1.5.4 Each vector bundle can be equipped with a bundle metric. 

□ 



1.5 Vector Bundles 


39 


It will be more important for us, however, that a Riemannian metric au- 
tomatically induces bundle metrics on all tensor bundles over M. The metric 
of the cotangent bundle is given in local coordinates by 


(u>, if) = g l3 u>iiy ior lu = Uidx 1 , = r]idx l . (1.5.5) 

(We recall that (g 13 ) is the matrix inverse to (gij))- 

Namely, this expression has the correct transformation behavior under 
coordinate changes: If w i— > x(w) is a coordinate change, we get 

dx^ 

uiidx 1 = u>i~ — dw a =: Cb a dw a . 
ow a 


while g lJ is transformed into 


h a 0 


dw a du>P 

Q J : 

dx i dxi 


and 

h a/3 oj a rjp = g^UiTjj. 

Moreover, we get 


||w(a;)|| = sup{w(a:) (v) : v G T X M , ||tij| = 1}. 

A Riemannian metric also induces an identification between TM and T* M : 

■ d 

v = v 1 -g—. corresponds to u) = u>jdx 3 

with Uj = gijV % 
or r' - g'-'xj . 

(1.5.5) may also be justified as follows: 

Under this identification, to v G T X M there corresponds a 1-form u> G 
T*M via 

u>(w) := (■ v,w ) for all w 
and (1.5.5) means then that 

IMMMI- 


For example, on TM 0 TM, the metric is given by 


{v 0 w, £ <8 t ?) = gijV l £ J guw k rf (1.5.6) 

(v = v l etc. in local coordinates). 


Definition 1.5.13 A local orthonormal basis of T X M of the type obtained 
in Theorem 1.5.3 is called an (orthonormal) frame field. 
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We put 

A P {T*M) := T*M A . . . AT*M (exterior product). 

V ' 

p times 

On A P [T*M ), we have two important operations: First, the exterior product 
by r, £ T*M = A\T*M) : 

A P (T*M) -> A P+1 (T*M) 

lo i — ► e(rj)oj := ij A lo. 

Second, the interior product or contraction by an element v £ T x M : 

A P (T*M) -► A P ^(T*M) 

U I — > i(v)u 

with 

v p - 1 ) := u>(v, Vi,..., u p _i) 

for v, vi , . . . , t>p_i £ T X M. 

In fact, such constructions may be carried out with any vector space W and 
its dual W* in place of T*M and T X M. This will be relevant in §1.8. 

The vector bundle over M with fiber A P (T*M) over x is then denoted by 
A p (M). 

Definition 1.5.14 The space of sections of A P (M) is denoted by 17 P (M), 
i.e. Q P (M) = r(A p (M)). Elements of f2 p (M) are called (exterior) p-forms. 

A p-form thus is a sum of terms of the form 

uj ( x ) = rj(x)dx n A ... A dx Zp 

where r](x) is a smooth function and (x 1 , . . . , x d ) are local coordinates. 

Definition 1.5.15 The exterior derivative d : fi p {M) — > Q p+l (M) (p = 
0 , ,d = dim M) is defined through the formula 

d(r](x)dx 11 A ... A dx lp ) = ^j^-dx^ A dx n A ... A dx lp 

ox 0 

and extended by linearity to all of f2 p (M). 

Lemma 1.5.2 If lo £ I2 p (M),d £ then d{u Ad) = dcoA , d+ (— l) p wA 

dd. 


Proof. This easily follows from the formula co A d = (— 1 ) pq d A to and the 
definition of d. □ 
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Let / : M — > IV be a differentiable map, 

u{z) = r](z)dz il A ... A dz ip G H P {N). 

We then define 

d P 1 d P p 

/*(w)(; r) = r](f(x))— — dx ai A ... A — dx ap . 

j \ JJg xai g x a p 

This obviously is the correct transformation formula for p- forms. 

Lemma 1.5.3 

Proof. This easily follows from the transformation invariance 

dr M dz j = dr MMl d IL dx <* = W)) ,y 

dzJ dzi dx a dx a 


□ 


Corollary 1.5.2 d is independent of the choice of coordinates. 

Proof. Apply Lemma 1.5.3 to a coordinate transformation /. □ 

Theorem 1.5.5 d o d = 0. 

Proof. By linearity of d, it suffices to check the asserted identity on forms of 
the type 

ui{x) = f(x)dx H A ... A dx lp . 


Now 

d o d(uj(x)) 

= d(^-rdx :i A dx 11 A ... A dx lp ) 
y dx> ’ 


= d. / dx k A dx^ A dx n A ... A dx lp 
dxidx k 

= 0 , 

o2 r o2 r 

since a k = a kl , ( f is assumed to be smooth) and 

OXJ OX K OX K OX3 \J / 

dx J A = — dx k A da ;- 7 . □ 


Let now M be a differentiable submanifold of the Riemannian manifold 
N: dim M = rn. dim N = n. We saw already that M then also carries a 
Riemannian metric. For x G M, we define 

T^M C T X N 
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by 

T^M := {v G T X N : Vro G T X M : (v,w) = 0}, 

where (., .), as usual, is the scalar product given by the Riemannian metric. 

The spaces T^rM are the fibers of a vector bundle T ± M over M, and 
TM and T ± M are both subbundles of TN\ M , the restriction of TN to M 
(in a more complicated manner: TN\m = i*TN , where i : M —* N is the 
differentiable embedding of M as a submanifold of N). In order to see this, 
one may choose the first m basis vectors v\, ... , v m of TN\m in the orthonor- 
malization procedure of the proof of Theorem 1.5.2 in such a manner that 
they locally span TM. 

Then TM is also locally spanned by wi, . . . , w m (notation as in the proof 
of Theorem 1.5.2), and the remaining basis vectors then span T ± M, and we 
have 

(wi, w a ) = 0 for i = 1 , . .., to, a = to + 1 , . . . , n. 

Thus, T ± M is the orthogonal complement of TM in TN\m- 

Definition 1.5.16 T ± M is called the normal bundle of M in N. 

For our example of the submanifold S' 1 of K 2 , T^-S 1 is the subbundle of 
TTRj^i, the restriction of TM 2 to S 1 , with fiber T^rS 1 — {Acc : A G R} C R 2 . 

We conclude this section with a consideration of the complex case - again, 
we remind the reader that is needed only in particular places, like 5.2. 

Definition 1.5.17 A vector bundle E over a differentiable manifold M is 
called a complex vector bundle if each fiber E z — ■n^ 1 {z) is a complex vector 
space, i.e. , isomorphic to zxC k , and if that complex structure varies smoothly, 
that is, the local trivializations are of the form 

ip : 7r _1 (C0 -> U x C fc . 


We thus have transition maps 


(pp a : U a n Up -» G1 (k, C) . 


Here, in contrast to the Definition 1.1.5 of a complex manifold, we neither 
require that the base M be complex nor that these transition maps be holo- 
morphic. If, however, these conditions are satisfied, that is, M is a complex 
manifold and the transition maps are ho lomorphic, then we have a holomor- 
phic vector bundle . 


On a complex manifold M, in local holomorphic coordinates, we have the 
1-forms 

dz i := dx J + idy\ dz k := dx J — idy j 
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(recall (1.1.2)). We can then decompose the space Q k of /c-forms into sub- 
spaces fl v ' q with p + q = k. Namely, f2 p,q is locally spanned by forms of the 

type , 

u(z) = rj{z)dz 11 A ... A dz lp A dz n A ... A dz 3q . 


Thus 


Q k (M)= (2 p ' q {M). 

P+g=fc 


We can then let the differential operators 


(1.5.7) 


d =-( ( dx 3 + idy 3 ) and d = - ( + * 77 —^ ( dx 3 — idy J ) 
2 \dxi dyi ) y y ’ 2 \dx 1 dyi J y y ’ 

(1.5.8) 

operate on such a form by 

dco = dz i A dz* 1 A ... A dz ip A dz' jl A ... A dz ]q (1.5.9) 
dz l y ’ 

and 

Bio = dz ] A dz h A ... A dz ip A dz ]l A ... A dz ]q . (1.5.10) 

dz q 

The following important relations link them with the exterior derivative 
d: 


Lemma 1.5.4 The exterior derivative d satisfies 

d = d + 8. (1.5.11) 

Moreover, 

dd = 0,88 = 0, (1.5.12) 

d8=-8d. (1.5.13) 


Proof. We have 

1 ( 8 


8 


d + d =0 hw ^ ( dxJ + id V J ) + o aTT + ( dx 3 ~ id V 3 ) 


2 \ dxi 8yi 

d d 

dxi dyi y 


8 


8 


2 \ dxi Qyi 


Therefore, 

0 = d 2 = (d + 8)(d + 8) = d 2 + d8 + 8d + 8 2 
and decomposing this into types yields (1.5.12) and (1.5.13). One may verify 
these relations also by direct computation, e.g. 

8 2 . , e 8 2 


dd = 


dzidz k 


dz J Adz = — 


dz k 8zi 


dz k A dz 3 = -dd. 


□ 
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1.6 Integral Curves of Vector Fields. Lie Algebras 

Let M be a differentiable manifold, X a vector field on M, i.e. a (smooth) 
section of the tangent bundle TM. X then defines a first order differential 
equation (or, more precisely, if dimM > 1, a system of differential equations): 

c = X(c). (1.6.1) 

This means the following: For each p £ M, one wants to find an open interval 
I = I p around 0 € R. and a solution of the following differential equation for 
c: I —> M 


-(t) = X(c(t)) forte/ 

c (0) = p ■ (1.6.2) 

One checks in local coordinates that this is indeed a system of differential 
equations: in such coordinates, let c(t ) be given by 

(c 1 (t), . . . , c d (t)) (d = dimM) 

and let X be represented by 

X i± 

dx r 

Then (1.6.2) becomes 

— (t) = X z (c(t)) for i = 1, . . . , d. (1.6.3) 

Since (1.6.3) has a unique solution for given initial value c(0) = p by the 
Picard-Lindelof theorem, we obtain 


Lemma 1.6.1 For each p £ M, there exist an open interval I p C R. with 
0 £ Ip and a smooth curve 

c p : Ip —> M 


with 


d ^{t)=X{cp{i)) 

Cp(0) =p. □ 


Since the solution also depends smoothly on the initial point p by the 
theory of ODE, we furthermore obtain 

Lemma 1.6.2 For each p £ M, there exist an open neighborhood U of p and 
an open interval I with 0 £ I, with the property that for all q £ U, the curve 
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c q (c q (t) = X(c q (t)),Cq( 0) = q) is defined on I. The map ( t,q ) i— > c q (t) from 
I x U to M is smooth. □ 

Definition 1.6.1 The map (t, q) i— > c q (t) is called the local flow of the vector 
field X. The curve c q is called the integral curve of X through q. 

For fixed q , one thus seeks a curve through q whose tangent vector at each 
point coincides with the value of X at this point, i.e. a curve which is always 
tangent to the vector field X. Now, however, we want to fix t and vary q ; we 
put 

Mo) '■= c g (t). 


Theorem 1.6.1 We have 

ip t °Ts(q) = Tt+s(q), if s,t,t+s£l q , (1.6.4) 

and if ipt Is defined on U C M, it maps U diffeomorphically onto its image. 

Proof. We have 

c q (t + s) = X(c q (t. + s)), 

hence 

c q (t. + s) = c Cq(s) {t). 

Starting from q , at time s one reaches the point c q (s ), and if one proceeds a 
time t further, one reaches c q (t + s). One therefore reaches the same point if 
one walks from q on the integral curve for a time t + s, or if one walks a time 
t, from c q (s). This shows (1.6.4). Inserting t = —s into (1.6.4) for s £ I qi we 
obtain 

p- s °Ts(q) = To (q) = q- 

Thus, the map <p~s is the inverse of <Psi and the diffeomorphism property 
follows. □ 

Corollary 1.6.1 Each point in M is contained in precisely one integral curve 
for (1.6.1). 

Proof. Let p £ M. Then p = c p (0), and so, it is trivially contained in an 
integral curve. Assume now that p = c q (t). Then, by Theorem 1.6.1, q = 
c p (—t). Thus, any point whose flow line passes through p is contained in the 
same flow line, namely the one starting at p. Therefore, there is precisely one 
flow line going through p. □ 

We point out, however, that flow lines can reduce to single points; this 
happens for those points for which X(p) = 0. Also, flow lines in general are not 
closed even if the flow exists for all f el. Namely, the points lim^-i-oo c p (t ) 
(assuming that these limits exist) need not be contained in the flow line 
through p. 
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Definition 1.6.2 A family (ip t ) tG j (/ open interval with 0 £ I) of diffeo- 
morphisms from M to M satisfying (1.6.4) is called a local 1-parameter group 
of diffeomorphisms. 

In general, a local 1-parameter group need not be extendable to a group, 
since the maximal interval of definition I q of c q need not be all of R. This is 
already seen by easy examples, e.g. M = R, X(t) = t 2 -^, i.e. c(t) = c 2 (t) 
as differential equation. 

However 


Theorem 1.6.2 Let X be a vector field on M with compact support. Then 
the corresponding flow is defined for all q € M and all t £ R, and the local 
1-parameter group becomes a group of diffeomorphisms. 


Proof. By Lemma 1.6.2, for every p £ M there exist a neighborhood U and 
e > 0 such that for all q € U, the curve c q is defined on (— £,e). Let now 
suppX C K , K compact. K can then be covered by finitely many such 
neighborhoods, and we choose £o as the smallest such e. 

Since for q £ K X(q) = 0, 


<Pt(q) = c q {t) 


is defined on (— £o,£o) x M, and for |s|, |£| < £ q / 2 , we have the semigroup 
property (1.6.4). 

Since the interval of existence (— £o,£o) may be chosen uniformly for all 
q , one may iteratively extend the flow to all of R. For this purpose, we write 
t £ R as £ 

t = m-^ + p with m £ Z, 0 < p < sq/2 

and put 

ft ■= (fe 0 /2) m O fp 


( ( / ? t)teR then is the desired 1-parameter group. 


□ 


Corollary 1.6.2 On a compact differentiable manifold, any vector field gen- 
erates a 1-parameter group of diffeomorphisms. □ 


The preceding is a geometric interpretation of systems of first order ODE 
on manifolds. However, also higher order systems of ODE may be reduced 
to first order systems by introducing additional independent variables. As an 
example, we want to study the system for geodesics, i.e. in local coordinates 

x*(f) + r l j k {x{t))x^ (t)x k (t) = 0, i = l,...,d. (1.6.5) 

We want to transform this second order system into a first order system on 
the cotangent bundle T*M. As usual, we locally trivialize T*M by a chart 

T*M\ V ~ U x R d 
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with coordinates (x 1 , . . . , x d ,pi, . . . ,pd )■ 

We also put 

H(x,p) = ^ g l \x)piPj (, g lJ (x)g jk (x ) = 5\) ■ (1.6.6) 

(The transformation behavior of g lJ and p k implies that H does not depend 
on the choice of coordinates.) 

Theorem 1.6.3 (1.6.5) is equivalent to the following system on T*M : 

■i dH ... 

1 =W, =S (x)n 

arr I 

i>i = ~dx i= ~2 gjk ^ x ^ Pk ^ jk •* := ■ (L6J) 


Proof. From the first equation 

x l = g lJ (x)pj + g lJ tk (x)x k pj 

= g l3 Vj + g lJ ,ki k gjex e 

and with the second equation then 

x l = -^g %3 g lk jPtPk + 9 lJ ,kgjex k x e 

= +^g lJ 9 tm 9mn,jg nk 9irX r g ks x s 
- g im gmn,kg n:> gjex k x e 

(from g lJ g jk = S k it follows that g lJ = -~g lm g m n,eg n:> ) 
= ^g lJ 9mn,jX m x n - g im g m n,kX k x n 

“ 2 9 d (.9 mn,j 9jn,m 9jm,n)X X 

Since gmn,kX X = — 9mn,kX X T — 9mk.nX X 

and after renumbering some indices 

— _ p 1 r m -r n 
1 mu' 1, •*' ■ 


□ 


Definition 1.6.3 The flow determined by (1.6.7) is called the cogeodesic 
flow. The geodesic flow on TM is obtained from the cogeodesic flow by the 
first equation of (1.6.7). 

Thus, the geodesic lines are the projections of the integral curves of the 
geodesic flow onto M. 
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The reason for considering the cogeodesic instead of the geodesic flow is 
that the former is a Hamiltonian flow for the Hamiltonian H from (1.6.6). 
We remark that by (1.6.7), we have along the integral curves 

= H x ii l + H Pi pi = -piX 1 + x l pi = 0. 
clt 

Thus, the cogeodesic flow maps the set E x := {(x,p) £ T*M : H(x,p) = A} 
onto itself for every A > 0. If M is compact, so are all E\. Hence, by Corollary 
1.6.2, the geodesic flow is defined on all of E\, for every A. Since M = E\, 

Theorem 1.6.3 yields a new proof of Theorem 1.4.7. 

If ip : M — » TV is a diffeomorphism between differentiable manifolds, and 
if X is a vector field on M, we define a vector field 

Y = ^X 

on N by 

Y(p) = d^Xty-'b))). (1.6.8) 

Then 

Lemma 1.6.3 For any differentiable function f : N — > K 

(V>*X)(/)Cp) = X(f o V0W’“ 1 (p)) . (1.6.9) 


Proof. 


(M)(/)W = (#oI)(/)(p) 

= {df odifoX)(tp^ 1 (p)) 

= X(fo^-\p)). □ 


If p : N — > P is another diffeomorphism, obviously 

(p o %/j)*X = (X)) (1.6.10) 

Lemma 1.6.4 Let X be a vector field on M, ip : M —> N a diffeomorphism. 
If the local 1-parameter group generated by X is given by (fit, the local group 
generated by ip*X is 

ip o ip t o ip^ 1 . 


Proof. ipoip t oip 1 is a local 1-parameter group, and therefore, by uniqueness 
of solutions of ODE, it suffices to show the claim near t = 0. Now 
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^(ip o tp t o ip 1 (p))| t=0 = dip(^tp t °ip 1 (p)|t=o) 

(where dip is evaluated at (po o ip^ 1 (p)) = ip ~ 1 (p)) 

= dipX(ip^ 1 (p)) — ip*X(p). □ 


Definition 1.6.4 For vector fields X, Y on M, the Lie bracket 

[X,Y] 

is defined as the vector field 

djrd__ j dx ^d_ A 

dxi dx i dx-i dx i y dx v dx ih 

We say that the vector fields X and Y commute, if 

[x,y] = o. 


Lemma 1.6.5 [X, Y] is linear (over R) in X and Y. For a differentiable 
function f : M — > R, we have [X,Y]f = X(Y(f)) — Y(X(f)). Furthermore, 
the Jacobi identity holds: 

[[X, Y],Z\ + [[y, Z\,X] + [[Z, X] ,Y]=0 
for any three vector fields X, Y, Z. 

Proof. In local coordinates with X = X 1 ,Y = Y l -JX , we have 
ayi a c a vi a f 

[x ^ f yJ i>.r.,J' x - Y ( L6 - n ) 

and this is linear in f,X,Y. This implies the first two claims. The Jacobi 
identity follows by direct computation. □ 

Definition 1.6.5 A Lie algebra (over R) is a real vector space V equipped 

with a bilinear map [•,■]: V x V — > V, the Lie bracket, satisfying: 

(i) [X, X] = 0 for all X G V. 

(ii) [X,[Y,Z]] + [Y,[Z,X}\ + [Z,[X,Y}] = 0 for all X,Y,Z G V. 

Corollary 1.6.3 The space of vector fields on M, equipped with the Lie 
bracket, is a Lie algebra. □ 

Lemma 1.6.6 Let ip : M — > N be a diffeomorphism, X,Y vector fields on 
M. Then 

[ip*X,ip*Y] = ip*[X,Y]. (1.6.12) 

Thus, ip * induces a Lie algebra isomorphism. 
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Proof. Directly from Lemma 1.6.3. □ 

We now want to investigate how one might differentiate tensor fields. A 
function / : M —> R, if smooth, may simply be differentiated at a point x by 
comparing its values at x with those at neighbouring points. For a tensor field 
S, this is not possible any more, because the values of S at different points 
lie in different spaces, and it is not clear how to compare elements of different 
fibers. For this purpose, however, one might use a map F of one fiber onto 
another one, and an element v of the first fiber may then be compared with 
an element w of the second fiber by comparing F(v) and w. One possibility 
to obtain such a map at least between neighbouring fibers (which is sufficient 
for purposes of differentiation) is to use a local 1-parameter group 
of diffeomorphisms. If for example X = X 1 -^ is a vector field, we consider 
(tl}-t)*X(if) t (x)). This yields a curve X t in T X M (for t € /), and such a curve 
may be differentiated. In particular, 

d dib k d 

(V’- t )* 7 ^(V’t(a:)) = (evaluated at (a;)). (1.6.13) 

(In general, one has for ip : M — > N <p*-£~r = but in case M = N 

and x and ip(x) are contained in the same coordinate neighborhood, of course 
d _ d \ 
dtp k dx k ' ’ 

If to — Lu\ fix 1 is a 1-form, we may simply consider 

W’DMW = ^z{fpt{x))^j,dx k , (1.6.14) 

which is a curve in T* M. 

In general for a smooth map ip : M —> N and a 1-form u> = LOidz 1 on N, 

c) 

P*0J ■- Ui(<p(x))-^dx k \ (1.6.15) 

note that <p need not be a diffeomorphism here. 

Analogously, for a section h = hijdz 1 (g> dz^ , of T*N ® T*A^ 

Bz i Bzi 

(ip*)h = h i j— j ^-^dx k ®dx t . (1.6.16) 

Finally, for a function / : N — > R of course 

P*f = f 0( P- (1.6.17) 

If ip : M — » N is a diffeomorphism, and Y is a vector field on N, we put 

p*Y := (^" 1 )*F. (1.6.18) 

in order to unify our notation. 

ip* is then defined analogously for other contravariant tensors. 
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In particular, for a vector field X on M and a local group (if t )tei as 
above: 

Wt)X = (1.6.19) 

Definition 1.6.6 Let X be a vector field with a local 1-parameter group 
(V’t)te/ °f local diffeomorphisms, S a tensor field on M. The Lie derivative 
of S in the direction X is defined as 

L X S j f {r t S)\t= 0 . 


Theorem 1.6.4 

i) Let f : M — > R. be a (differentiable) function. Then 

L x (f) = df(X) = X(f). 


ii) Let Y be a vector field on M. Then 

L x Y = [X,Y], 


in) 


Let to = ujjdxl be a 1-form, on M. Then for X = 


r fduj , dX i 
LxU - Vd^ x + d^ L 


dx J . 


Proof. 

i) L x (f) = |#/, t=0 = if o ipt lt=0 = &X* = X(f) (cf. (1.6.17)). 

n) y = y { i h- 


L X Y 

= by (1.6.19) 

d c) 

= It^'^ldxfdxi^ 0 by ( L6 - 13 )> Lemma L6 - 3 


dY l 


d 


dXY d 


= Qffk X + Yi (-TYT)XTP since ^ = id, T J-t it=0 = -X 


dxi 


dx i dxi 


dt 




dx k 
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Lxv = —(iplu)\ t =o 
H dilfi 

= TA“M)^dx k )\ t=0 by (1.6.14) 


dt 

du: 3 


dx i *%dx k + u~dx k , since V>o = id, f f K=o = X 


- ( dujj X i 


dX l 

dxi 


i)da 


□ 


In this manner, also Lie derivatives of arbitrary tensor fields may be com- 
puted. For example for h = hijdx 1 (g) dx 3 


L x h = hi X kX k dx l (g> dx 3 

, dX l , . , OX 3 , i , 

' ha ; -dx (g dx 3 + hij - , dx g dx 


dx k 

= {hij^X k + ft,, 


kj 


dX k 
dx * 


y ctr fc 
dX k 


+ hi kx — r )dx l ® da;- 7 
dx 3 


( 1 . 6 . 20 ) 


Remark. For vector fields X,Y,Z and '0 = ip t , the local flow of X, Lemma 
1.6.6 yields by differentiation at t = 0 

L x [F,Z] = [L x y,Z] + [y,L Y Z], 

and with Theorem 1.6.4 ii), we then obtain the Jacobi identity 

[X, [Y,Z]] = [[X,Y\,Z\ + [Y,[X,Z}\ 

= ~[Z, [X,Y]]-\Y,[Z,X]]. 


Definition 1.6.7 Let M carry a Riemannian metric 

g = gijdx 1 (g dx 3 . 

A vector field X on M is called a Killing field or an infinitesimal isometry if 

L x (g) = 0. (1.6.21) 


Lemma 1.6.7 A vector field X on a Riemannian manifold M is a Killing 
field if and only if the local l -parameter group generated by X consists of local 
isometries. 


Proof. From (1.6.21) 


= °- 


( 1 . 6 . 22 ) 
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Since this holds for every point of M, we obtain 

i p^g = g for all t G I. 

Therefore, the diffeomorphisms f/’t are isometries. Conversely, if the ift are 
isometries, (1.6.22) holds, hence also (1.6.21). □ 

Lemma 1.6.8 The Killing fields of a Riemannian manifold constitute a Lie 
algebra. 

Proof. The space of all vector fields on a differentiable manifold constitute a 
Lie algebra by Corollary 1.6.3. The claim then follows if we show that the 
space of Killing fields is closed under the Lie bracket [.,.], i.e. that for any two 
Killing fields X and Y, [X, Y] is again a Killing field. This, however, follows 
from the following identity which was derived in the proof of Theorem 1.6.4 
ii): 

[X,Y] = L X Y = j t d1>- t Y(tlH) | t=0 , 

where is the local group of isometries generated by X. Namely, for 

any fixed t, 

ip-t °Vs° i/>t, 

is the local group for drp-tY(ip t ), where (ip s ) sG i is the local group generated 
by Y. Since ift and ip s are isometries, so are ip-t. o ip s o ip t . 

It follows that 

L[X,Y]9 = g\s=t=o = 0 . 

Thus, [X, Y) indeed is a Killing field. □ 


1.7 Lie Groups 

Definition 1.7.1 A Lie group is a group G carrying the structure of a dif- 
ferentiable manifold or, more generally, of a disjoint union of finitely many 
differentiable manifolds for which the following maps are differentiable: 

GxG-tG (multiplication) 

(g,h) >->• g -h 

and 

G — > G (inverse) 

9 ^ 9~ 1 ■ 

We say that G acts on a differentiable manifold M from the left if there 
is a differentiable map 
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G x M -> M 
(g, x) i-> gx 

that respects the Lie group structure of G in the sense that 

g(hx) = ( g ■ h)x for all g,h € G, x G M. 

An action from the right is defined analogously. 

The Lie groups we shall encounter will mostly be linear algebraic groups. 
In order to describe the most important ones, let F be a vector space over R. 
of dimension n. We put 

G1(F) := {A : V — > F linear and bijective}, 

the vector space isomorphisms of F. 

If F is equipped with a scalar product (•,•), we put 

0(F) := {A G G1(F) : (Av,Av) = (v,v) for all v G F.} 

and 

SO (F) :={A G 0(F) : the matrix {Aei,ej)ij=\ t ..^ n has positive 

determinant for some (and hence any) basis ei, ..., e„ of F}. 

(In the terminology of § 2.1 below, one might express the last condition 
as: A transforms positive bases into positive bases.) Clearly SO(F) C 0(F). 
G1(F), SO(F) and 0(F) become Lie groups w.r.t. composition of linear maps. 
Since bijectivity is an open condition, the tangent space to G1(F), for example 
at the identity linear map, i.e. the Lie algebra of G1(F), can be identified with 

g[(F) := {X : V — > F linear}, 

the space of endomorphisms of F. The Lie algebra bracket is simply given by 

[X, Y] = XY — YX. 

The Lie algebra of SO(F) then is obtained by differentiating the relation 
( Av,Aw ) = (v,w), i.e. as 

so(F) := {X G 0((F) : (Xv,w) + (v,Xw) = 0 for all v,w G F}, 

the skew symmetric endomorphisms of V. (Of course, this is also the Lie 
algebra of 0(F), and therefore in the sequel, we shall sometimes write o(F) 
in place of so(F).) 

The relation between a Lie algebra and its Lie group is given by the 
exponential map which in the present case is simply 


e x =ld + X+^X 2 + i X 3 + . . . 
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For ieR, we have 

e tx = Id + tX + f ^X 2 + ... 

As the ordinary exponential map converges, this series converges for all t G R, 
and e tA is continuous in t. 

For s,t£l, we have 

e (s+t)x = e sx e tx 

In particular 

e x e~ x = Id 

Therefore, e x is always invertible, i.e. in G1(V), with inverse given by e~ x . 
Thus, for each X G fl((E), 

t ^ e tx 

yields a group homomorphism from R to G1(V). 

We assume that (-,■) is nondegenerate. Every X G g[(E) then has a 
adjoint X* characterized by the relation 

(Xv,w) = (v,X*w) for all v,w G V. 


With this notation 

X G so(V) ■<=>• X = —X*. 

For X G so(V), then 

(e' Y )* = Id + X* + ^(X*) 2 + . . . 

= ld-X+ ^X 2 - . . . = e~ x = (e x ) -1 , 

hence e x G SO(V). 

In fact, the exponential map maps so(V) onto SO(V). However, the ex- 
ponential map from gl(V) is not surjective; its image does not even contain 
all elements of G1+(V), the subgroup of automorphisms of V with positive 
determinant (w.r.t. some basis). 

Typically, (V, (•,•)) will be the Euclidean space of dimension n, i.e. R" 
with its standard Euclidean scalar product. For that purpose, we shall often 
use the notation Gl(n,R) in place of G1(V), fll(n), O(n), SO(n), o(n), so (n) 
in place of fll(E), 0(E), SO(E), o(H), so(E) etc. 

Sometimes, we shall also need complex vector spaces. Let Vc be a vector 
space over C of complex dimension m. We put 

Gl(Vc) := {A : Vc — » Vc complex linear and bijective}. 
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If Vc is equipped with a Hermitian product (•,•), we put 

U(Vc)(:= U(Vfc, (•,•))) := {A € Gl(Vc) : ( Av,Aw ) = (v,w) for all v,w G Vc} 
SU(H C ) := {A G U(V C ) : det A = 1 }. 

The associated Lie algebras are 

gl(Vc) := {X : Vc —> ► Vc complex linear} 
u(Vfc) := {X G 0[(Vfc) : (Xv,w) + (v,Xw) = 0 
for all v, w G Vc}, 

(the skew Hermitian endomorphisms of Vc), and 

su(Vc) := {X G u(Vfc) : trX = 0} 

(the skew Hermitian endomorphisms with vanishing trace), where the trace 
tr is defined using a unitary basis e\, . . . ,e m of Vc, i.e. (e*, ej) = Sjj. 

If V is C m with its standard Hermitian product, we write Gl(m, C), U(m), 
SU(m) etc. in place of Gl(Vfc), U(Vc), SU(Vc) etc. 

For A, B G G1(H), we have the conjugation by A. 

Int (A)B = ABA’ 1 (1.7.1) 

For X G 0 l(V), then the induced action of A is given by 

(AdH)X = AX A- 1 , 

and for Y G 0 i(H), we obtain the infinitesimal version 
(ad Y)X = YX -XY = [Y, X] 

as follows by writing B = e tx , A = e sY and differentiating (1.7.1) w.r.t. t 
and s and s = t = 0 . 

Thus, Ad and ad associate to each element in G1(V) resp. 0 l(H) a linear 
endomorphism of the vector space gl(H). Thus, Ad and ad yield representa- 
tions of the Lie group G1(V) and the Lie algebra g[(V), resp., on the vector 
space g[(V). These representations are called adjoint representations. 

The unit element of a Lie group G will be denoted by e. 

For g £ G, we have the left translation 

L g :G-^G 
h i— > gh 

and the right translation 

R g : G —> G 

h i— > hg. 

L g and R g are diffeomorphisms of G, (L g ) _1 = L g - 1 . 
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A vector field X on G is called left invariant if for all g,h € G 

L g ,X(h) = X{gh), 

(see (1.6.8) for the definition of L g *; note that we should write (L fl )* for L g *) 

i.e. 

L g *X = XoL g . (1.7.2) 

Theorem 1.7.1 Let G be a Lie group. For every V £ T e G, 

X(g):=L g *V (1.7.3) 

defines a left invariant vector field on G, and we thus obtain an isomorphism 
between T e G and the space of left invariant vector fields on G. 

Proof. 

X (gh) L^gh'jj.V Lg^Lft^V L g *X(Jf) 

which is left invariance. 

Since a left invariant vector field is determined by its value at any point 
of G, for example at e, we obtain an isomorphism between T e G and the space 

of left invariant vector fields. □ 

By Lemma 1.6.6, for g £ G and vector fields X,Y 

[L g ,X,Lg,Y]=L g ,[X,Y]. (1.7.4) 

Consequently, the Lie bracket of left invariant vector fields is left invariant 

itself, and the space of left invariant vector fields is closed under the Lie 
bracket and hence forms a Lie subalgebra of the Lie algebra of all vector 
fields on G (cf. Corollary 1.6.3). 

From Theorem 1.7.1, we obtain 

Corollary 1.7.1 T e G carries the structure of a Lie algebra. □ 

Definition 1.7.2 The Lie algebra g of G is the vector space T e G equipped 
with the Lie algebra structure of Corollary 1.7.1. 

We may easily construct so-called left invariant Riemannian metrics on 
a Lie group G by the following procedure: 

We select a scalar product (-, •) on the Lie algebra T e G. 

For h £ G,V £ ThG , there exists a unique V e £ T e G with 

V = L h *V e (1.7.5) 


since L] x is a diffeomorphism. 
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We then put for V, W £T h G 

{V,W) := {V e ,W e ). (1.7.6) 

This defines a Riemannian metric on G which is left invariant. 

In analogy to the definition of a vector bundle (Definition 1.5.1) where 
the fiber is a vector space we now define a principal bundle as one where the 
fiber is a Lie group. 

Definition 1.7.3 Let G be a Lie group. A principal G -bundle consists of a 
base M, which is a differentiable manifold, and a differentiable manifold P, 
the total space of the bundle, and a differentiable projection n : P — » M, 
with an action of G on P satisfying: 

(i) G acts freely on P from the right: (q, g) G P x G is mapped to qg G P, 
and qg ^ q for g ^ e. 

The G-action then defines an equivalence relation on P : p ~ q : -4=> 
3g G G : p = qg. 

(ii) M is the quotient of P by this equivalence relation, and 7 t : P —> M 
maps q £ P to its equivalence class. By (i), each fiber 7r _1 (a:) can then 
be identified with G. 

(iii) P is locally trivial in the following sense: 

For each x G M, there exist a neighborhood U of x and a diffeomor- 
phism 

(p : — > U x G 

of the form ip(p) = (7r (p),ip(p)) which is G-equivariant, i.e. p(pg) — 
(■jr(p),ip(p)g) for all g G G. 

As in Definition 1.5.2, a subgroup H of G is called the structure group of 
the bundle P if all transition maps take their values in H. Here, the structure 
group operates on G by left translations. 

The notions of vector and principal bundle are closely associated with 
each other as we now want to explain briefly. Given a principal G-bundle 
P — > M and a vector space V on which G acts from the left, we construct 
the associated vector bundle E — > M with fiber V as follows: 

We have a free action of G on P x V from the right: 

P xV x G -> P xV 
(p,v) ■ g = (p ■ g,g~ 1 v) . 

If we divide out this G-action, i.e. identify (p,v) and (p. v) ■ g, the fibers of 
(P x V)/G — » P/G become vector spaces isomorphic to V, and 


E := P x G V := (P x V) /G -► M 
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is a vector bundle with fiber G XqV := {G x V)/q = V and structure group 
G. The transition functions for P also give transition functions for E via the 
left action of G on V. Conversely, given a vector bundle E with structure 
group G, we construct a principal G-bundle as 

n u a x g / ~ 

a 

with 

( x a ,g a ) ~ (xp, gp) : x a = xp G U a n Up and gp = g>p a (x)g a 

where {U a } is a local trivialization of E with transition functions <pp a , as in 
Theorem 1.5.1. 

P can be considered as the bundle of admissible bases of E. In a local 
trivialization, each fiber of E is identified with R”, and each admissible basis 
is represented by a matrix contained in G. The transition functions describe 
a base change. 

For example, if we have an SO(n) vector bundle E, i.e. a vector bundle 
with structure group SO(n), then the associated principal SO(n) bundle is 
the bundle of oriented orthonormal bases (frames) for the fibres of E. 


Perspectives. Lie groups, while only treated relatively briefly in the present text 
book, form a central object of mathematical study. An introduction to their ge- 
ometry and classification may be found in [116]. As symmetry groups of physical 
systems, they also play an important role in modern physics, in particular in quan- 
tum mechanics and quantum field theory. 

We shall encounter Lie groups again in chapter 5 as isometry groups of sym- 
metric spaces. A theorem of Myers-Steenrod says that the isometry group of a 
Riemannian manifold is a Lie group. For a generic Riemannian manifold, the isom- 
etry group is discrete or even trivial. A homogeneous space is a Riemannian man- 
ifold with a transitive group G of isometries. It may thus be represented as G/H 
where H := {g £ G : gx o = xo} is the isotropy group of an arbitrarily selected 
xo £ M. Homogeneous spaces form important examples of Riemannian manifolds 
and include the symmetric spaces discussed in chapter 5. 


1.8 Spin Structures 

For the definition of the Dirac operator in §3.4 and its applications in Chapter 
7, we need a compact Lie group, Spin(n), which is not a subgroup of Gl(?r, R), 
but rather a two-fold covering of SO(n) for n > 3. The case n = 4 will be 
particularly important for our applications. In order to define Spin(n), we 
start by introducing Clifford algebras. 

We let V be a vector space of dimension n over R, equipped with a 
positive definite inner product (•,•). We put ||u|| := (v,v)z, for every v € V. 
For a substantial part of the algebraic constructions to follow in fact a not 
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necessarily nondegenerate quadratic form on V would suffice, but here we 
have no need to investigate the most general possible construction. On the 
contrary, for our purposes it suffices to take R" with its standard Euclidean 
scalar product. An orthonormal basis will be denoted by ei, ... ,e n . 

Definition 1.8.1 The Clifford algebra C\(V), also denoted Cl(n), is the quo- 
tient of the tensor algebra ® V ® . . . <g) V generated by V by the two sided 

fc >0 

ideal generated by all elements of the form v <S> v + ||i>|| 2 for v £ V. 

Thus, the multiplication rule for the Clifford algebra C1(V) is 

vw + wv = — 2(v, w) (1.8.1) 

In particular, in terms of our orthonormal basis e \, . . . , e„, we have 

e, 2 = —1 and e*ej = — e^ej for i ^ j. (1.8.2) 

From this, one easily sees that a basis of C1(V) as a real vector space is given 
by 

Co • 1, Cq. . & ol \ Cq, 2 * * • Cctk 

with a = {a \, . . . , oq.} C {1, . . . , n} and aq < 02 • • ■ < Ofc- For such an a, we 
shall put |a| := k in the sequel. Thus, as a vector space, C1(V) is isomorphic 
to A*{V) (as algebras, these two spaces are of course different). In particular, 
the dimension of C1(V) as a vector space is 2 n . Also, declaring this basis as 
being orthonormal, we obtain a scalar product on C1(V) extending the one 
on V. 

We define the degree of e a as being |a|. The e a of degree k generate the 
subset Cl fc (C) of elements of degree k. We have 

Cl° = M 

Cl 1 = V. 


Finally, we let C\ ev (V) and C\ odd (V) be the subspaces of elements of even, 
resp. odd degree. The former is a subalgebra of C1(V), but not the latter. 

Lemma 1.8.1 The center of C1(V) consists of those elements that commute 
with all v £ C1 1 (E) = V. For n even, the center is Cl°(V r ), while for n odd, 
it is Cl°(V)©Cr(V). 

Proof. It suffices to consider basis vectors e a = e ai . . . e ak as above. For 
j fL a, we have 

c ( y Cj — ( 1)1 I Cje a , 

and thus |a| has to be even for e a to commute with e,j, while 

e e — ( — lV a ^ p 
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so that | a | needs to be odd for a commutation. 

The conclusion follows easily for monomials and with a little algebra also 
in the general case. □ 

We next observe that 

Cl 2 =: spin(V) (or simply spin(n)) 

is a Lie algebra with the bracket 

[a,b] = ab — ba (1.8.3) 

For that, note that [a, 6] € Cl 2 (y) if as, b G Cl 2 (y) as an easy consequence of 

(1.8.2). 

To verify this, let us first consider the case 
a e&j, b — G k G[. 
with the indices i,j , k, l all different. In this case 


CiCjCk^l 

= e k eieiej - e k eieiej = 0 by (1.8.2) 


Another case is 


Then, using (1.8.2) 


CL — G% Gj , b — Gj G k . 


GiGjGjGk CjCk^iCj — ^i^k 

— Ci&k 6k&i 
= -2GiG k G Cl 2 (y). 

From these two cases, the general pattern should be clear. 

In a similar manner, the bracket defines an action r of Cl 2 (V r ) on Cl 1 (V) = 
V: 

r(a)v := [a,v] := av — va (1.8.4) 

Again, by (1.8.2) [o,v] G Cl^F) if a G Cl 2 (^), v G Cl^F). 

Let us consider the two typical cases as before, first 


CL — GiGj , V — 6 /-, 

with i,j,k all different. Then 

G^GjG k G k G^Gj GiGjG k GiGjG k 0 . 


The second case is 


CL — Si Gj , 


v = e*, 
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Then 


— e^CiCj — 2ej G Cl (P). 

Lemma 1.8.2 r defines a Lie algebra isomorphism between spin(P) and 
so{V). 

Proof. Since, as noted, r(a) preserves V, and since one readily checks that 
r[o, b] = [r(a),r(6)], r defines a Lie algebra homomorphism from spin(F) = 
C1 2 (V) to 0 [(V). For a G C1 2 (P), 

(t(cl)v,w) + (v,t(cl)w) = ~^[[a,v],w] - *[ v , [a,w]] by (1.8.1) 

= 0 (1.8.5) 

as one easily checks by employing (1.8.2), after the same pattern as above. 

Therefore, r(a) G so(P) for all a G C1 2 (P). It follows from Lemma 1.8.1 
that r is injective on C1 2 (P). Since C1 2 (P) and so both are vector spaces of 
dimension n( '" 2 ~ l ^ , and r is an injective linear map between them, r in fact 
has to be bijective. □ 


In the Clifford algebra C1(P), one can now define an exponential series as 
in p[(P), and one may define the group Spin(P) as the exponential image of 
the Lie algebra spin(V). Spin(P) then becomes a Lie group. This follows from 
general properties of the exponential map. Here, however, we rather wish to 
define Spin(r>) directly, as this may be more instructive from a geometric 
point of view. 

For that purpose, let us first introduce an anti-automorphism a i— > a 4 of 
C1(P), defined on a basis vector e ai e a2 . . . e ak as above by 

(e ai e a2 . . . Cafe) • • • ^'CK2^'CU ( ( 1) 2 ^ai ^a2 * • • ^ctfc) (1.8.6) 


In particular 


f 1 if A: is even 
l—l if A: is odd 


Also, for all a, b G C1(V) 


(o6)* = 6*0*. 


(1.8.7) 

( 1 . 8 . 8 ) 


Definition 1.8.2 Pin(P) is the group of elements of C1(V) of the form 


a = a± . . . ak with o* G V, ||aj|| = 1 for i = 1, . . . , k 

Spin(V) is the group Pin(P) nCl e,; (P), i.e. the group of elements of Cl(u) of 
the form 


a = Oi . . . a 2 m with Oi G V, || a* || = 1 for * = 1, ... , 2m (to G N). 
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We shall often write Pin(n), Spin(n) in place of Pin(R"), Spin(R ra ), resp. 

From (1.8.7), we see that Spin(V) is the group of all elements a € Pin(P) 
with 

aa* = 1 (1.8.9) 


Theorem 1.8.1 Putting 

p(a)v := ava 4 

defines a surjective homomorphism p : Pin(P) — > O(V) with p(Spin(P)) = 

SO(V’). 

In particular, Pin(P) C C1(V) acts on V. This is the so-called vector 
representation, not to be confused with the spinor representation introduced 
below. 

Proof. We start with a G V, ||a|| = 1. In that case, every v € V decomposes 
as 

v = Xa + a -1 , with (a, a -1 ) = 0, A G R. 

Then, since a = a* for a € V 

p(a)v = a(Xa + a~ L )a 
= —A a — aaa 1 - 

= —Xa + a- 1 . 

Consequently p(a) is the reflection across the hyperplane orthogonal to a. 
This is an element of O(V). Then also for a general a = a\ ... G Pin(P), 
p(a) is a product of reflections across hyperplanes, hence in O(V). The pre- 
ceding construction also shows that all reflections across hyperplanes are 
contained in the image of p(Pin(P)). Since every element in O(V) can be rep- 
resented as a product of such reflections 1 , it follows that p(Pin(P)) = O(V). 
If now a G Spin(P), then p{a) is a product of an even number of re- 
flections, hence in SO(F). Since every element SO(V) can conversely be 
represented as a product of an even number of reflections, it follows that 
p(Spin(V)) = SO(V). 

From (1.8.8), it is clear that p(ab) = p(a)p(b), and so p defines a homo- 
morphism. □ 

Let us now determine the kernel of 

p : Spin(V') — ► SO(V). 

1 Every rotation of a plane is a product of two reflections, and the normal form of 
an orthogonal matrix shows that it can be represented as a product of rotations 
and reflections in mutually orthogonal planes. 


since aa = aaf = —1 by (1.8.7) 
and a ± a + aa ^ = 0 by (1.8.2) 
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If a £ ker p, then p{a)v = v for all v £ V. From the definition of p and aa l = 1 
for a € Spin(V), we obtain that this is equivalent to 

av = va for all v £ V, 

i.e. a commutes with all elements of V. Since all elements in Spin(H) are 
even, Lemma 1.8.1 implies atR. Since aaf = 1, we conclude that 

a = ±1. 

We next claim that Spin(H) is connected for diiripF > 2. Let 

a = . . . a 2 m G Spin(H), with at in the unit sphere of V. (1.8.10) 

Since that sphere is connected, we may connect every cq by a path ai{t) to 
ei. Hence, a can be connected to ei . . . ei (2m times), which is ±1. Thus we 
need to connect 1 and —1. We use the path 

7 (t) = (cos ei + sin f^t) e 2 ) (cos f^t ) e x - sin e 2 ) 

= -cos^(|t) + sin^(|t)-2sin(|t)cos(|t)e 1 e 2 

since eiei = e 2 e 2 = —1. 

This path is contained in Spin(H) and satisfy 7 (0) = —1, 7 (1) = 1, and we 
have shown connectedness of Spin(H) for diniRH > 2. 

(1.8.10) also easily implies that Spin(H) is compact. If we finally use the 
information that 7 Ti(SO(H)) = Z 2 for n = diniRV > 3, we obtain altogether 

Theorem 1.8.2 p : Spin(H) — »■ SO(V) is a nontrivial double covering. 
Spin(H) is compact and connected, and for dimRH > 3, it is also simply 
connected. Thus, for diniRV > 3, Spin(H) is the universal cover of SO(V). 

□ 

Let us briefly return to the relation between spin(H) and Spin(H). If we 
differentiate the relation characterizing Spin(y), i.e. 

aaf = 1 and ava f £ V for all v £ V, 

(differentiating means that we consider a = 1 + eb + 0(e 2 ) and take the 
derivative w.r.t. e at e = 0), we obtain the infinitesimal relations 

b + b* = 0 and bv + vb t = bv — vb for all v £ V, 

which were the relations satisfied by elements of spin(H) = C1 2 (H). Since 
the preceding implies that Spin(H) and spin(H) have the same dimension, 
namely the one of SO(V) and so(H), i.e. ra(?l 2 ~ 1 ^ , spin(H) indeed turns out to 
be the Lie algebra of the Lie group Spin(F). 

Let us also discuss the induced homomorphism 
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dp : spin(F) — > so(F), 

the infinitesimal version of p. The preceding discussion implies that dp coin- 
cides with the Lie algebra isomorphism r of Lemma 1.8.2. In order to obtain 
a more explicit relation, we observe that a basis for so(n), the Lie algebra of 
skew symmetric n x n-matrices is given by the matrices e* A ej , 1 < i < j < n, 
(denoting the skew symmetric matrix that has —1 at the intersection of the 
i th row and the j th column, +1 at the intersection of the j th row and the i th 
column, and 0 entries elsewhere) 2 , e j A e : j is the tangent vector at the identity 
of SO(n) for the one parameter subgroup of rotations through an angle '0 
in the plane from e, towards ej. In Spin(n), we may consider the one 
parameter subgroup 

d i— > e,(— cos (d)ei + sin(i?)ej) = cos(d) + sin($)e.jej. 

Its tangent vector at 1, i.e. at d = 0, is ejCj. 


Lemma 1.8.3 


dp(eiej) = 2(ej A ej) 


Proof. We have seen in the proof of Theorem 1.8.1, that p(a) is the reflection 
across the hyperplane perpendicular to a, for a unit vector a £ R". Thus, 
p(cos(d) + sin (d)ejej) is the reflection across the hyperplane orthogonal to 
— cos (d)ei + sin(d)ej followed by the one across the hyperplane orthogonal 
to ej. This, however, is the rotation in the ej,ej plane through an angle of 
2d from e, towards ej. □ 

Examples. 

1) From its definition, the Clifford algebra C1(M) is R[x]/(a; 2 + 1), the 
algebra generated by x with the relation x 2 = —1. In order to make 
contact with our previous notation, we should write e\ in place of x. 
Of course this algebra can be identified with C, and we identify the 
basis vector e\ with i. Cl eu (R) = C1°(R) then are the reals, while 
C1 ocM (R) = C1 1 (R) is identified with the purely imaginary complex 
numbers. Pin(R) then is the subgroup of C generated by ±i, and 
Spin(R) is the group with elements ±1. 

2) C1(R 2 ) is the algebra generated by x and y with the relations 

x 2 = -1, y 2 = -1, xy = -yx. 

Again, we write ei, e-i in place of x, y. This algebra can be identified 
with the quaternion algebra H, by putting 

i = e\, j = e-iy k = e^. 


2 For the sake of the present discussion, we identify V with R" ( n = dirng V). 
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Since i 2 = j 2 = k 2 = — 1, ij + ji = ik + ki = jk + kj = 0 
the relations (1.8.2) are indeed satisfied. The subalgebra Cl ei; (K 2 ) 
is generated by k, and thus it is isomorphic to C C Bt, where 
the purely imaginary complex numbers correspond to multiples of 
k. Pin(R 2 ) is generated by the circle cos($)* + sin($)j through i 
and j ('(} € S' 1 ). Spin(R 2 ) then is the group consisting of prod- 
ucts (cos($i)i + sin(i?i)j)(cos(i? 2 )* + sin(t? 2 )j) ($i,$ 2 € S 1 ) = 

— cos cos t?2 — sin $i sin t? 2 + (cos i9i sin i9 2 — cos $2 sin i9i)k, i.e. the 
unit circle in the above subspace C C H. (So, while Pin(V) is gener- 
ated by 1 Spin(V) is generated by 1 ,k. i and j act on R 2 by 

reflection while k acts as a rotation.) Thus, Spin(R 2 ) is isomorphic 
to U(l) = S 1 . We should note, however, that it is a double cover of 
SO (2) as ±1 both are mapped to the trivial element of SO (2). 

3) Similarly, we identify C1(R 3 ) with El © H by putting 

e 0 = (l,l), ei = (i,-i), e 2 = (j,-j) e 3 = (k,-k). 


Then 

ei e 2 = (k,k), e 2 e 3 = (i,i), e 3 e 1 = (j,j), 

and Cl el ’(M 3 ) is identified with the diagonal embedding of EI into 
H©EI. Since C1 1 (R 3 ) = R 3 is identified with the pairs (a, — a ) of purely 
imaginary quaternions a, Pin(R 3 ) is generated by such elements of 
length 1. Spin(R 3 ) then is the group of pairs (j3, (3) of unit quaternions 
f3, as every such pair can be obtained as a product («i, — ai)(a 2 , — ct 2 ) 
where a.\, a 2 are purely imaginary unit quaternions themselves. Thus, 
Spin(R 3 ) is isomorphic to the group Sp(l) of unit quaternions in EL 
One also knows that this group is isomorphic to SU(2). In fact, we 
have a natural linear embedding 

7 : EI C 2x2 (2 by 2 matrices with complex coefficients) 
by writing w G EI as 

w = (wq + kw\) + j(w 2 + kw 3 ) = ui + jip 


with Wo,Wi,w 2 ,w 3 € 
and putting 


Then 


while we consider u> and ip as elements of C, 




7(1) = 


0 


0 


7 GO = 


0 -1 

1 0 


7(fc) = 


i 0 
0 -i 


These matrices satisfy the same commutation relations as i,j,k, and 
7 (ww 1 ) = 'y(w)j(w'), y(w) = j(w)* 
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4) 


for all w,w' G H. Thus, 7 is an algebra homomorphism. 7 induces an 
isomorphism between Sp(l) and SU(2). 

C1(R 4 ) is identified with H 2x2 , the space of two by two matrices with 
quaternionic coefficients, by putting 


eo 




e 3 




Pin(R 4 ) is generated by the unit sphere in C1 1 (K 4 ) = K 4 , i.e. in our 
identification by all linear combinations of ei 62, e 3, e 4 of unit length. 
Spin(R 4 ) then is the group of products of two such elements, i.e. the 

group of all elements of the form ^ ^ ^ ^ where a and /? are unit 

quaternions. Thus, Spin(R 4 ) is homeomorphic to S 3 x S' 3 = Sp(l) x 
Sp(l) = SU(2) x SU(2). From Theorem 1.8.2, we then infer that 


SO(4) Spin (4) /Z 2 * (SU(2) x SU(2))/Z 2 


In the sequel, we shall also need the complex Clifford algebra and the cor- 
responding spin group. For V as before, we denote the complexified Clifford 
algebra by 

Cl c (C) = C1(V) 0 R C. 

Thus, the e a again form a basis, and the only difference is that we now admit 
complex coefficients. 

For the sequel, we need to choose an orientation of V , i.e. select an (or- 
thonormal) basis e±, ... ,e n of V being positive. (Any other basis of V ob- 
tained from this particular one by an element of SO(V) then is also called 
positive.) 

Definition 1.8.3 Let ei, . . . , e n be a positive orthonormal basis of V. The 
chirality operator is 

r = i m ei ,...,e n GCf(V) 

with m = for even n, m = for odd n. 

It is easy to check that T is independent of the chosen positive orthonor- 
mal basis. To see the mechanism, let us just consider the case n = 2, and the 
new basis /1 = cosrlei + sin$e 2 , / 2 = — sin$ei +cos$e 2 . Then 

/i/ 2 = — sint? cos -deiei + sin-dcos'de 2 e 2 + cos 2 deie 2 — sin 2 ■de 2 ei 
= eie 2 by (1.8.2) 
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Lemma 1.8.4 T 2 = 1. 

For odd n, rv = vT for all v £ V. 

For even n, rv = —vT for all v £ V. 

Proof. A simple computation based on (1.8.2) □ 

Thus, we may use T to obtain a decomposition 

Cl^V)* 

of Cl c (C) into the eigenspaces with eigenvalue ±1 under multiplication by 
r. This is particularly interesting for even n, because we have 

uCl c (y) ± = Cl c (V0 T for every v £ V \ {0}, 

i.e. Clifford multiplication by v interchanges these eigenspaces. This is a sim- 
ple consequence of Lemma 1.8.4, namely if e.g. 


.Ta = a 


then 

rva = —vTa = —va. 

Definition 1.8.4 Spin c (C) is the subgroup of the multiplicative group of 
units of Cl c (C) = C1(V) <g> C generated by Spin(V) and the unit circle in C. 

Lemma 1.8.5 Spin c (C) is isomorphic to SpinC Xz 2 S 1 , where the Z2 action 
identifies ( a,z ) with (— a,—z ). 

Proof. By Lemma 1.8.1, the unit complex scalars are in the center of Cl c (V), 
and hence commute with Spin(l / ). Therefore, we obtain a map 

Spin(C) x S 1 -> Spin c (C), (1.8.11) 

which is surjective. The kernel of this mapping are the elements (a, z) with 
az = 1, which means a = z £ Spin(C) fl S 1 . We have already seen in 
the preparations for Theorem 1.8.2 that this latter set consists precisely of 
± 1 . ' □ 

By Lemma 1.8.5 changing (a, z) to (—a, z) amounts to the same as chang- 
ing (a, z) to ( a,—z ), and thus we obtain an action of Z2 on Spin c (y). The 
quotient of Spin c (C) by this action yields a double covering 

Spin c (V0 -► SO(V) x S 1 (1.8.12) 

that is nontrivial on both factors. 

The maps given in (1.8.11), (1.8.12) allow to determine the fundamental 
group 7Ti(Spin c (V r )). Namely, a homotopically nontrivial loop 7 in S' 1 induces 
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a loop in Spin°(U) that is mapped to the loop 2y in S 1 by (1.8.12) (2y means 
the loop 7 traversed twice) which again is nontrivial. Thus, 7Ti(Spin c (U)) 
contains 717 (S' 1 ) = Z as a subgroup. On the other hand, if we have a loop 
in Spin c (U) that is mapped to a homotopically trivial one in S' 1 when we 
compose (1.8.12) with the projection on the second factor, it is homotopic to 
a loop in the kernel of that composition. That kernel can be identified with 
Spin(U) by (1.8.11), and since Spin(U) is simply connected by Theorem 1.8.2 
for dim V > 3, such a loop is homotopically trivial for dim V > 3. Thus 

Theorem 1.8.3 For dim V > 3 

7n(Spin c (V)) = Z 


□ 

Examples. The treatment here will be based on the above discussion of ex- 
amples in the real case. 

1) Cl c (R) = C1(K) ®r C = C © C, and Spin c (R) = S 1 sits diagonally in 
this space. 

2) Cl c (M 2 ) = C1(R 2 ) ® R C = H <g> R C. We want to identify Cl c (R 2 ) with 
C 2x2 , the space of two by two matrices with complex coefficients. 
We consider the above homomorphism of algebras El — > C 2x2 , and 
extending scalars, we obtain an isomorphism of C-algebras 

EI®C -> C 2x2 . 

Thus, we identify C1 C (R 2 ) with C 2x2 . Under this identification, 
Spin(R 2 ) corresponds to the elements 

(0 5) with a e ^ = U ( X ) c C - 

Spin c (K 2 ) then consists of the unitary diagonal matrices, i.e. Spin c (R 2 ) 
= U(l) x U(l) = S 1 x S 1 . 

3) C1 C (K 3 ) = C1(K 3 )®C = (H©H)®C = C 2x2 ®C 2x2 from the preceding 
example. We have identified Spin(R 3 ) with SU(2), and so 

Spin c (R 3 ) S {e^U :i>eR,f/e SU(2)} = U(2) 

Similarly, C1 C (K 4 ) = C1(M 3 ) ® R C = H 2x2 ® C = C 4x4 . We have 
identified Spin(R 4 ) with SU(2) x SU(2), and so 

Spin c (K 4 ) = Spin(R 4 ) x Z2 S 1 

£* {([/, V) G U(2) x U(2) : det U = det V} 


4) 
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In order to describe the isomorphism C 1 C (R 4 ) = C 4x4 more explicitly, 
we recall the homomorphism 7 : H — > C 2x2 from the description of 
C 1 (K 3 ). We define 

r : H -> C 4x4 


via 


We recall 


r(w) = ( 0 7(w) ^ 

w V-7M* 0 )■ 


7(1) = 


7 U) = 


1 0 

0 1 

0 -1 

1 0 


7 (i) = 


7 (fc) = 


0 i 

1 0 


i 0 
0 -i I ' 


We identify M 4 with H, putting ei = 1 , ei = i, e% = j, e 4 = k. Then 


r(e 1 )r(e 2 ) = 


r( ei )r(e 3 ) = 


r(e 1 )r(e 4 ) = 


r(e 2 )T(e 3 ) = 


r(e 2 )T(e 4 ) = 


r(e 3 )T(e 4 ) = 


^0 i 
i 0 

V 

(0 -1 

1 0 

V 

I i 0 
0 -i 

V 

( i 0 

0 -i 

\ 

(° 1 

-1 0 

V 

1 0 i 

1 0 


0 -i 
-i 0 


0 -1 

1 0 


-* 0 

0 i . 


i 0 

0 -i, 


= -r(e 2 )r( ei ) 


= -r(e 3 )r(e 1 ) 


— —r(e 4 )r(e 1) 


= -r(e 3 )r(e 2 ) 


0 x \= -r(e 4 )r(e 2 ) 

-1 0 , 


V 


0 -i 
-i 0 


= -r(e 4 )r(e 3 ) 


(always with 0 ’s in the off diagonal blocks). One also easily checks 
that 
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r{e a )r(e a ) = —Id, for a = 1, 2, 3, 4. 

Thus, r preserves the relations in the Clifford algebra, and it is not 
hard to verify that T in fact extends to the desired isomorphism 
between Cl c (R 4 ) and C 4x4 . 

The preceding examples seem to indicate a general pattern that we now 
wish to demonstrate by induction on the basis of 

Lemma 1.8.6 For any vector space V as above 

Cl c (C © R 2 ) Cl c (C) 0 c Cl c (R 2 ) 


Proof. We choose orthonormal bases vi,...,v n ofV and ei, e 2 of R 2 . In order 
to define a map that is linear over R, 

l : V © R 2 -» Cl c (C) <g> c Cl c (R 2 ) 

we put 

l(vj) := ivj ®e 1 e 2 , j = 1, ... ,n 

l(e a ) := 1 0 e a , a = 1,2. 

Since for example 

l(vjVk + VkVj) = (- VjVk - VkVj) 0 eie2eie2 = VjVk + VkVj 0 1 

l{vj e a + e a Vj) = ivj 0 (eie 2 e a + e Q eie 2 ) = 0 for a = 1,2 

we have an extension of l as an algebra homomorphism 

l : C1(V © R 2 ) -> C1 C (C) 0c C1 C (R 2 ). 

Extending scalars from R to C, we obtain an algebra homomorphism 

l : Cf(V © R 2 ) Cl c (F) 0c Cl c (R 2 ). 

Now l has become a homomorphism between two algebras of the same di- 
mension, and it is injective (and surjective) on the generators, hence an iso- 
morphism. □ 


Corollary 1.8.1 

i) If dim R C = 2 n, Cl c (E) ^ C 2nx2n , 

ii) If dim R C = 2n + 1, Cl c (E) = C 2nx2n © C 2 " x2 ^ 


Proof. By example 2), Cl c (R 2 ) = C 2x2 , and the proof follows from Lemma 
1.8.6 by induction, starting with example 2) in the even and example 1) in 
the odd dimensional case, and using 


C” 


'0c C 2x2 


-<2mx2m 


□ 


We now wish to identify Cl c (C) for even dimensional V as the algebra of 
endomorphisms of some other vector space in a more explicit manner than in 
Corollary 1.8.1. We thus assume that n = diniRE is even, n = 2 m. We also 
choose an orientation of V, i.e. select a positive orthonormal basis ei, . . . , e n . 
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In V (g> C, we consider the subspace W spanned by the basis vectors 


Vj ■= ( e 2 j-i - ie 2j ), j = l,...,m. 

If we extend the scalar product (•, •} to V ® C by complex linearity, we have 


hence 


(Vii Vj)c = 0 for all j, 


( w , iv)c = 0 for all w £ W 


(1.8.13) 


(1.8.14) 


(One expresses this by saying that W is isotropic w.r.t. (-,-)c) 

We have 

V ® C = W © W, 

with W spanned by the vectors rfj = -^(e 2 j-i + ie 2 j), j = 1, . . . , to. Because 

of (1.8.14), W is the dual space W* of W w.r.t. he. for every w € 

W \ {0}, there exists a unique w' G W with ||w/|| = 1 and 

(w,w')c = ||w||. 


Definition 1.8.5 The spinor space S is defined as the exterior algebra AW 
of W. If we want to emphasize the dimension n of V, we write S n in place of 

5. 

We may then identify Cl c (l / ) as Endc(S') as follows: We write v € V®C 
as 

v = w + w' with w € W, w' £ W, 
and for s £ S = AW, we put 

p(w)s := y/2 e(w)s (= \/2w A s, as e denotes the exterior product) 

p{w')s := — \/ 2 t ( w')s (where l(w') denotes the interior product; note that 

we identify W with the dual space W* of W, c.f. 

§ 1-5) 

p obviously extends to all of Cl C (V r ) by the rule p(vw) = p(v)p(w). 

We have the following explicit rules for e(w) and i(w'): If s = rjj 1 A . . . r]j k , 
with 1 < ji < . . . < jk < m, then 

e (Vj) s = Vj Ar?ji A ... Ar] jk (= 0 if j £ {j x , . . . ,j k }), (1.8.15) 


and 

f0 _ if j t 

\ (-1 A . . . A T] U A . . . A T] jk if j = jp. 


In particular 


e(iy)u(rij)s 


0 if j i {ji,---,jk} 

s if j € {ji , . . , , jk} 


(1.8.16) 

(1.8.17) 
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Thus, we have for all s and all j 


s if j ^ {iu- ■ - ,jk} 
0 if j G {ji, . . . ,jk} 


+ L(Vj)t(Vj))s = s 


(1.8.18) 


(1.8.19). 


For subsequent use in 5.2, we also record that in the same manner, one sees 
that 

(e(Vj)i'(jJe) + = 0 for j ^ l. (1.8.20) 

In order to verify that the claimed identification is possible, we need to 
check first that p preserves the relations in the Clifford algebra. The following 
examples will bring out the general pattern: 



= -(e(mM?h) + t(r?i)e(r?i)), since e(?yi) 2 = 0 = (.(rq) 2 
= -1 by (1.8.19) 


p(eie 2 ) + p(e 2 e 1 ) = (efa) - t(p 1 ))i(e(p 1 ) + ifa)) 

+ *(e(?7i) + i-(Vi))(e(m) - t(f?i)) 

= 0 

p(eie 3 ) + p(e 3 ei) = (e(r?i) - t(?? 1 ))(e(t?2) - <-(%)) 

+ (e(%) - ife))(e(m) - t-iVi)) 

= (e(t/i)e(%) + e{p 2 )(e{m) 

+ 

~ (e(m M%) + t(i?2)e( ? 7i)) 

- (e(%)t(?7i) + t(i7i)e(%)) 

= 0 

since the 6(771), • ■ • , i(p 2 ) all anticommute. 

(e.g. e(?7i)t(?72) ? 72 A ?? 3 = 6(771)773 = Vi A 773 

<■(? h) e ( 7 li)V2 A 773 = t( 7 ? 2 )??i A ry 2 A 773 = -771 A 773.) 

Now dimcCl c (F) = 2 " = (dime (TIT)) 2 = dime (Endc(<S')), and since p has 
nontrivial kernel, we conclude 


Theorem 1.8.4 If n = diniRT 7$ e7;en, Cl c (T) is isomorphic to the algebra 
of complex linear endomorphisms of the spinor space S. □ 

(Later on, we shall omit the symbol p and simply say that Cl c (T) operates 
on the spinor space S via Clifford multiplication, denoted by “•”). Now since 


VjVj - I <2., 
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we have 

r = 2 ~ m (771 rj 1 - rj 1 771 ) . . . ( IlmVm - VmVm) 
and so T acts on the spinor space S = AW via 

P( r ) = (- 1 ) m (e(Pi)'-(Pi) - t(r? 1 )e(??i)) . . . (e(»? m )t(^ m ) - L(rJ m )e(r] m )), 

and for the same reasons as in the computation of p(ei 2 ), we see that p(T) 
equals (— l) fc on A k W . As above, any representation of C1 C (V), in particular 
p, decomposes into the eigenspaces of T for the eigenvalues ±1, and so in the 
present case we have the decomposition 

S ± := A ± W 

where the + (— ) sign on the r.h.s. denotes elements of even (odd) degree. 

Since Spin(V) sits in C1(V), hence in Cl c (E), any representation of the 
Clifford algebra C1 C (V) restricts to a representation of Spin(V), and we thus 
have a representation 

p : Spin(C) — > Endc(S'). 

Since Spin(C) C C1 + (F), Spin(C) leaves the spaces and S~ invariant, 
and thus the representation is not irreducible, but decomposes into the ones 
on S + and S' - . (The latter are in fact irreducible.) 

Definition 1.8.6 The above representation p of Spin(C) on the spinor space 
S is called the spinor representation , and the representations on S + and S~ 
are called half spinor representations. 

Note that the spinor space S = AW is different from the Clifford space 
C1(V) (= A*(V) as a vector space). C1(V), and therefore also V, acts on both 
of them by Clifford multiplication. 

We now want to extend the representation of Spin(V) to Spin c (C). 

Lemma 1.8.7 Let a : Spin(C) — * Endc(T) be a complex representation of 
Spin(V) on some vector space T, satisfying 

cr(-l) = -1. 

Then a extends in a unique manner to a representation 

a : Spin c (C) — > Endc(T). 


Proof. Since a is complex linear, it commutes with multiplication by com- 
plex scalars, in particular with those of unit length. Thus, a extends to 
a' : Spin(C) x S 1 — > Endc(T). Since er(— 1) = —1, it descends to Spin c (V r ). 

□ 
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Corollary 1.8.2 The spinor and half spinor representations o/Spin(V r ) pos- 
sess unique extensions to Spin c (H). □ 


Of course, this is also clear from the fact that these representations of 
Spin(V) come from C1 C (V). 

Let us discuss the example of C1 C (R 4 ) once more. We recall the isomor- 
phism 

r : Cl c (R 4 ) -> C 4x4 

r in fact is the representation described in Theorem 1.8.4, and C 4 is isomor- 
phic to S4,. The formulas given above for the products r(e a )r(ep ) also show 
that the representation admits a decomposition into two copies of C 2 that is 
preserved by the elements of even order of Cl c (R 4 ). In fact, these yield the 
half spinor representations £4 in dimension 4. In the above formulas, the 
upper left block corresponds to S + , the lower right one to S~ . 

In dimension 4, we also have a decomposition 

A 2 = A 2 ’ + © yl 2>_ (yl 2 = A 2 V* , d i m V = 4) 


of exterior two forms. Namely, we have the Hodge * operator (to be discussed 
in § 2.1 for arbitrary dimensions) determined by 

*(e l A e 2 ) = e 3 A e 4 
*(e 4 A e 3 ) = -e 2 A e 4 
*(e 4 A e 4 ) = e 2 A e 3 
*(e 2 A e 3 ) =e 1 Ac 4 
*(e 2 A e 4 ) = — e 1 A e 3 
*(e 3 A e 4 ) =e 1 A e 2 


and linear extensions, where e 1 , . . . , e 4 is an orthonormal frame in V* . 
We have 


** = 1 , 


and * thus has eigenvalues ± 1 , and yl 2,± then are defined as the corresponding 
eigenspaces. Both these spaces are three dimensional. A 2,+ is spanned by 
e 1 A e 2 + e 3 A e 4 , e 1 A e 3 — e 2 A e 4 , e 1 A e 4 + e 2 A e 3 , while yl 2, ~ is spanned 
by e 1 A e 2 — e 3 A e 4 , e 1 A e 3 + e 2 A e 4 , e 1 A e 4 - e 2 A e 3 . Elements of yl 2,+ are 
called selfdual, those of yl 2, ~ antiselfdual. 

We have a bijective linear map AV* — * C1 2 (H), given by e* A — > e* • e^ 

(where e* is the orthonormal frame in V* dual to the frame e, in V). 

Therefore, T induces a map T 1 : A 2 V* — > End(C 4 ). In the above decom- 
position of the representation of ci c,ev (M. 4 ) , the selfdual forms then act only 
on C 2 © {0}, while the antiselfdual ones act only on {0} ©C 2 , as one directly 
sees from the formulae for T(e Q )T(e / j) and the description of the bases of 
A 2 ’±. 
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Finally, let us briefly summarize the situation in the odd dimensional case. 
Here, according to Corollary 1.8.1, Cl c (H) is a sum of two endomorphism 
algebras, and we therefore obtain two representations of Cl c (H). When re- 
stricted to Spin(H), these representations become isomorphic and irreducible. 
This yields the spinor representation in the odd dimensional case. We omit 
the details. 

We also observe that the spinor representation is a unitary representation 
in a natural manner. For that purpose, we now extend the scalar product 
(•, •) from V to V 0 C as a Hermitian product, i.e. 

n n n 

(Y a i e i, Y A' e j) = Y ai Pi for ai, . . . , a n , Pi, . . . , /3 n e C. 

i — 1 j = 1 i — 1 

Note that this is different from the above complex linear extensions (-,-)c- 
This product extends to AV by letting the monomials e,;, A . . . e» fc , 1 < i\ < 
...<... ifc < n, constitute an orthonormal basis. From the above computa- 
tions for the p(ej), one checks that each p{ef) preserves (•,•), i.e. 

(p(ej)s, p(ej)s r ) = ( s,s' ) for all s, s' e AW. 

Of course, this then holds more generally for every v € V with ||u|| = 1, and 
then also for products V\ . . . V/- with ||uj|| = 1 for j = 1, ... ,k. This implies 

Corollary 1.8.3 The induced representation of Pin(H) and Spin(H) on 
Endc(S') preserves the Hermitian product (-,•)• □ 

Corollary 1.8.4 

( p(v)s,s ') = —(s,p(v)s'} for all s, s' € AW, v € V. 


Proof. We may assume ||v|| = 1. Then p{v) 2 = —1, hence 

(p(v)s,s') = —(p(v)s,p(v)p(v)s') = —(s,p(v)s') by Corollary 1.8.3. 


□ 

After these algebraic preparations, we may now define spin structures on 
an oriented Riemannian manifold M. We let TM be the tangent bundle of 
M. The Riemannian metric allows to reduce the structure group of TM to 
SO(n) (n = dim M), and we obtain an associated principal bundle P over M 
with fiber SO(n), the so-called frame bundle of M. 

Definition 1.8.7 A spin structure on M is a principal bundle P over M 
with fiber Spin(n) for which the quotient of each fiber by the center ±1 is 
isomorphic to the above frame bundle of M. A Riemannian manifold with a 
fixed spin structure is called a spin manifold. 
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In other words, we require that the following diagram commutes 


7T 

M 

where 7 r denotes the projection onto the base point, and p is the nontrivial 
double covering p : Spin(n) — ► SO(n) on each fiber as described in Theorem 
1.8.2. This is also expressed by saying that the frame bundle is lifted to a 
Spin(ro) bundle. It is important to note that such a lift need not always be 
possible. One way to realize this is by considering the corresponding transition 
functions. We recall from §1.5 that the frame bundle P for each trivializing 
covering (U a ) a€ A of M induces transition functions 

ipp a : U a O Up — > SO(ra) 



satisfying 


<Paa(x) = id 
VapWvpa = id 
p oll (x)fp 1 p(x)ipp a (x) = id 


for x £ U a 

for x € U a O Up 

for x e u a n Up n u 1 . 


Lifting the frame bundle to a Spin(ro) bundle then requires finding transition 
functions 

<P/ 3 a ■ U a D Up -> Spin(n) 


with 

p(& 0 a) = vpa for all/?, a (1.8.21) 

and satisfying the same relations as the <pp a . By making the U a suffi- 
ciently small, in particular simply connected, lifting the ipp a to p>p a satisfying 
(1.8.21), is no problem, but the problem arises with the third relation, i.e. 


ip a p{x)(p p 1 (x)(p ja (x) = id for x £ U a fl Up D Uj. (1.8.22) 

Namely, it may happen that p a p(x)ip/ 3 1 (x) and p ia (x) differ by the nontrivial 
deck transformation of the covering p : Spin(?r) — ■> SO(n). 

In fact, the existence of a spin structure, i.e. the possibility of such a 
lift, depends on a topological condition, the vanishing of the so-called Stiefel- 
Whitney class W 2 ^M) £ iJ 2 (M, Z 2 ). Here, however, we cannot define these 
topological concepts. Furthermore, if a spin structure exists, it need not to 
be unique. 

Let us assume that M possesses a spin structure P — > M. Since the fiber 
Spin(n) of P operates on the spinor space S n and for even n also on the half 
spinor spaces via the (half) spinor representations, we obtain associated 
vector bundles §„, over M with structure group Spin(n), 
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with 


• P - >< 'Spin(n) ^ni &n '■ — P '*'Spin(n) 


± 

n ’ 


S„ = ® §„ for even n. 


Definition 1.8.8 S„ is called the spinor bundle, S„ the half spinor bundles 
associated with the spin structure P. Sections are called (half) spinor fields. 


From Corollary 1.8.3, we infer that these bundles carry Hermitian prod- 
ucts that are invariant under the action of Spin(n), and even of Pin(n), 
on each fiber. In particular, Clifford multiplication by a unit vector in 
R" C C1(R") is an isometry on each fiber. We may also consider Spin°(n) in 
place of of Spin(n) and ask for a lift of the frame bundle P over M to a princi- 
pal Spin°(?r) bundle P c . Of course, the requirement here is that the map from 
a fiber of P c to the corresponding one of P is given by the homomorphism 

Spin c (n) — > SO(n) 

obtained from (1.8.12) by projecting onto the first factor. 

Definition 1.8.9 Such a principal Spin c (n) bundle P c (if it exists) is called 
a spin c structure on M. An oriented Riemannian manifold M equipped with 
a fixed spin 0 structure is called a spin c manifold. 


Again, the existence of a spin 0 structure depends on a topological con- 
dition, namely that W 2 (M) lifts to an integral class in H 2 (M, Z 2 ). Again, 
however, we cannot explain this here any further. We point out, however, 
that the required condition is satisfied for all oriented Riemannian manifolds 
of dimension 4. Thus, each oriented four-manifold possesses a spin 0 structure. 
Given a spin 0 structure, we may also consider the homomorphism 

Spin°(n) — > S 1 

obtained from (1.8.12) by projecting on the second factor. Identifying S 1 with 
U(l),we see that a spin 0 structure induces a set of transition functions for a 
vector bundle L with fiber C, a so called (complex) line bundle. 


Definition 1.8.10 The line bundle L is called the determinant line bundle 
of the spin 0 structure. 

As in the case of a spin structure, a spin 0 structure induces (half) spinor 
bundles S±, cf. Corollary 1.8.2. 

We return to the frame bundle P over M with fiber SO(n). SO(n) acts on 
Cl(R ra ) and Cl c (R n ) simply by extending the action of SO(n) on R”. Thus, 
P induces bundles 
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C1(P) = P X so(to) C1(R") 

Cl c (P) = P x SO („) Cl c (R") 

of Clifford algebras. 

Definition 1.8.11 The bundles C1(P) and Cl c (P) are called the Clifford 
bundles. 

Again, these Clifford bundles can be decomposed into bundles of elements 
of even and of odd degree. The chirality operator P (cf. Def. 1.8.3) is invariant 
under the action of SO(n),and it therefore defines a section of C1 C (P) of norm 

1. 

The definition of the Clifford bundles did not need a spin or spin c structure 
on M. But suppose now that we do have such a structure, a spin structure, 
say. Spin(n) acts on Cl c (R ra ) by conjugation. 

p{a)v = ava^ 1 for a € Spin(n), v € Cl c (]R n ) (1.8.23) 

(cf. Theorem 1.8.1 (note that a 4 = a -1 for a € Spin(n) by (1.8.9)) for the 
action of Spin(n) on R n , and extend this action to Cl c (R”); this is of course 
induced by the above action of SO(n) on Cl c (R")). This action commutes 
with the action of Spin(n) on Cl c (R") given by (1.8.23) and the action of 
Spin(n) on S n ; namely for a € Spin(n), v € Cl c (R"), s £ S n 

( ava~ 1 )(as ) = a(vs). (1.8.24) 

This compatibility with the Spin(n) actions ensures that we get a global 
action 

Cl c (R") x S„ S n (1.8.25) 

which is the above action by Clifford multiplication on each fiber. 

Perspectives. References for this § are [7], [166], [259], [18], [212], [187]. 


Exercises for Chapter 1 

1) Give five more examples of differentiable manifolds besides those dis- 
cussed in the text. 

2) Determine the tangent space of S n . (Give a concrete description of 
the tangent bundle of S n as a submanifold of S n x K" +1 .) 

3) Let M be a differentiable manifold, r : M — ■> M an involution without 
fixed points, i.e. r o r = id,r(a;) x for all x € M. We call points 
x and y in M equivalent if y = r( x). Show that the space M/t 
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of equivalence classes possesses a unique differentiable structure for 
which the projection M — > M/t is a local diffeomorphism. 

Discuss the example M = S n C K" +1 ,r(a;) = —x. M/t is real pro- 
jective space KP”. 


a: Let N be a differentiable manifold, / : M — > N a homeomor- 

phism. Introduce a structure of a differentiable manifold on M 
such that / becomes a diffeomorphism. Show that such a dif- 
ferentiable structure is unique. 

b: Can the boundary of a cube, i.e. the set {x G K”; max{|xj| : i = 

1, . . . , n} = 1} be equipped with a structure of a differentiable 
manifold? 

We equip R” +1 with the inner product 

(x, y) ■.= -x°y° + x 1 y 1 + . . . + x n y n 
for x = (x° jX 1 , . . . ,x n ),y = (y 0 ,^ 1 , . . . ,y n )- We put 

H n := {x £ K" +1 : (x,x) = — 1 ,Xq > 0}. 


Show that (•,•) induces a Riemannian metric on the tangent spaces 
T p H n C T p K" +1 for p £ H n . H n is called hyperbolic space. 

In the notations of 5), let 


s = (-i,o,..., o) g r +1 

2(x — s) 


/( x) '.= s — 


(X — s, x — S) 


Show that / : H n — > G K" : |£| < 1} is a diffeomorphism (here, 
K" = {(0, re 1 , . . . ,x n )} C K™ +1 ). Show that in this chart, the metric 


assumes the form 


4 

FfF 


de®de. 


Determine the geodesics of H n in the chart given in 6) (The geodesics 
through 0 are the easiest ones.). 

Hint for 5), 6), 7) : Consult §4.4. 

Determine the exponential map of the sphere S n , for example at the 
north polep. Write down normal coordinates. Compute the supremum 
of the radii of balls in T p S n on which exp p is injective. Where does 
exp p have maximal rank? 

Same as 8) for the flat torus generated by (1, 0) and (0, 1) G R 2 . 
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What is the transformation behavior of the Christoffel symbols under 
coordinate changes? Do they define a tensor? 

Let Co, ci : [0,1] — > M be smooth curves in a Riemannian manifold. 
If d(co(t), Ci(t)) < i(co(t)) for all t, there exists a smooth map c : 
[0,1] x [0,1] — > M with c(t, 0) = c 0 (£),c(t, 1) = Ci(t) for which the 
curves c(t, •) are geodesics for all t. 

Consider the surface S of revolution obtained by rotating the curve 
(x,y = e x ,z = 0) in the plane, i.e. the graph of the exponential 
function, about the x — ax in Euclidean 3-space, equiped with the 
induced Riemannian metric from that Euclidean space. Show that X 
is complete and compute its injectivity radius. 

Show that the structure group of the tangent bundle of an oriented 
d-dimensional Riemannian manifold can be reduced to SO(d). 

Can one define the normal bundle of a differentiable submanifold of a 
differentiable manifold in a meaningful manner without introducing 
a Riemannian metric? 

Let M be a differentiable submanifold of the Riemannian manifold 
N. M then receives an induced Riemannian metric, and this metric 
defines a distance function and a topology on M, as explained in §1.4. 
Show that this topology coincides with the topology on M that is 
induced from the topology of N. 

We consider the constant vector field X(x) = a for all x £ R n+1 . We 
obtain a vector field X(x) on S n by projecting X(x) onto T x S n for 
x £ S n . Determine the corresponding flow on S n . 

Let T be the flat torus generated by (1,0) and (0,1) € R 2 , with 
projection tt : R 2 — > T. For which vector fields X on R 2 can one 
define a vector field w*X on T in a meaningful way? Determine the 
flow of ir*X on T for a constant vector field X. 

Compute a formula for the Lie derivative (in the direction of a vector 
field) for a p-times contravariant and g-times covariant tensor. 

Show that for arbitrary vector fields X , Y. the Lie derivative satisfies 
Lx ° Ly — Ly O Lx = L[x,Y]- 


Prove Corollaries 4.2.3 and 4.2.4 below with the arguments used in 
the proofs of Theorem 1.4.5 and Corollary 1.4.2. 



2. De Rham Cohomology and Harmonic 
Differential Forms 


2.1 The Laplace Operator 

We need some preparations from linear algebra. Let V be a real vector space 
with a scalar product and let A P V be the p-folcl exterior product of V. 
We then obtain a scalar product on A P V by 

('Ci A ... A Up, iui A ... A w p ) = det((uj, Wj)) (2.1.1) 

and bilinear extension to A P (V). If ei, . . . , is an orthonormal basis of V, 

ej, A . . . A e.i p with 1 < i\ < 12 < ■ ■ ■ < ip < d (2.1.2) 

constitute an orthonormal basis of A P V. 

An orientation on V is obtained by distinguishing a basis of V as positive. 
Any other basis that is obtained from this basis by a base change with positive 
determinant then is likewise called positive, and the remaining bases are 
called negative. 

Let now V carry an orientation. We define the linear star operator 
* : A p {V) -> A d ~ p {V) (0 < p < d) 


by 

*( e ii A ... A ei p ) = ej 1 A ... A ej d _ p , (2.1.3) 

where j 1 , . . . , jd-p is selected such that e^, ... ,6^,6^, , ej d _ p is a positive 
basis of V. Since the star operator is supposed to be linear, it is determined 
by its values on some basis (2.1.3). 

In particular, 


*(l) = eiA...Ae d (2.1.4) 

*(e 1 A...Ae <I ) = l ) (2.1.5) 

if ei, . . . , ed is a positive basis. 

From the rules of multilinear algebra, it easily follows that if A is a d x d- 
matrix, and if / 1 , . . . , f p € V, then 


*(A/i A ... A Af p ) = (det A) * (f ± A ... A f p ). 
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In particular, this implies that the star operator does not depend on the 
choice of positive orthonormal basis in V, as any two such bases are related 
by a linear transformation with determinant 1. 

For a negative basis instead of a positive one, one gets a minus sign on 
the right hand sides of (2.1.3), (2.1.4), (2.1.5). 

Lemma 2.1.1 ** = (_i)p(<i-p) ; AP(V) -> A P (V). 


Proof. ** maps A p ( V) onto itself. Suppose 

*( e u A ... A e ip ) = e h A . . . A e jd _ p (cf. (2.1.3)). 


Then 


* * (ejj A ... A e ip ) = ±e il A . . . A e ip , 


depending on whether ej 1 , . . . , e.j d _ p , , . . . , e* is a positive or negative basis 

of V. Now 


A ... A e*. 


A ej 1 A ... A Cj d _ p 


= (-l) p(d p) e h A ... A e jd _ p A e ix A . . . A e ip , 

and (— l)p( d -p) thus is the determinant of the base change from , . . . , ej dp 
to ej 1 , . . . , e ip . □ 

Lemma 2.1.2 For v, w € A P (V) 

( v , w) = *(w A *u) = *(v A *u>). (2.1.6) 


Proof. It suffices to show (2.1.6) for elements of the basis (2.1.2). For any two 
different such basis vectors, w A *v = 0, whereas 

* {e iy A . . . A e ip A *(6^ A ... A e ip )) 

= *(ei A ... A e d ), where e \, . . . , e d 

is an orthonormal basis ((2.1.3)) 

= 1 ((2.1.5)), 

and the claim follows. □ 

Remark. We may consider (-, •) as a scalar product on 

A(V) ■.= ® A p (V) 

p = 0 

with A P (V) and A q {V) being orthogonal for p ^ q. 

Lemma 2.1.3 Let Vi, ... ,v d be an arbitrary positive basis of V. 
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Then 


*( 1 ) 


1 

sj det((i>j, Vj)) 


vi A ... A v d 


(2.1.7) 


Proof. Let e \, . . . , e d be a positive orthonormal basis as before. 

Then 

«i A . . . A Vd= (det((ni,tij)))5ei A . . . A e d , 
and the claim follows from (2.1.4). □ 


Let now M be an oriented Riemannian manifold of dimension d. Since M 
is oriented, we may select an orientation on all tangent spaces T X M , hence 
also on all cotangent spaces TfM in a consistent manner. We simply choose 
the Euclidean orthonormal basis g|r, . . . , of as being positive. Since 
all chart transitions of an oriented manifold have positive functional deter- 
minant, calling the basis . . . , d(/? _1 (g|j) of T X M positive, will not 

depend on the choice of the chart. 

Since M carries a Riemannian structure, we have a scalar product on each 
TfM. We thus obtain a star operator 

* : A p (TfM) -► A d ~ p (TfM), 
i.e. a base point preserving operator 

* : f2 p (M) -► Q d ~ p {M) (fF(M) = r(A p (M))). 

We recall that the metric on T*M is given by (g^(x)) = ( gij{x))~ 1 . There- 
fore, by Lemma 2.1.3 we have in local coordinates 


*(1) = \J~Aet{gij)dx 1 A ... A 


dx a 


( 2 . 1 . 8 ) 


This expression is called the volume form. 
In particular 


vol (M) ■- 


*( 1 ) 


M 


(2.1.9) 


(provided this is finite). 

For a,f3 £ f2 p (M) with compact support, we define the L 2 -product as 


(a,/3) : = J(a,P)* (1) 

M 

= J a A */3 by Lemma 2.1.2 . 

M 


This product on f2 p (M) is obviously bilinear and positive definite. 
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So far, we have considered only smooth sections of vector bundles, in 
particular only smooth p-forms. For later purposes, we shall also need L p - 
and Sobolev spaces of sections of vector bundles. For this aim, from now 
on, we deviate from Definition 1.5.3 and don’t require sections to be smooth 
anymore. We let E be a vector bundle over M, s : M — > E a section of E with 
compact support. We say that s is contained in the Sobolev space H k,r (E), 
if for any bundle atlas with the property that on compact sets all coordinate 
changes and all their derivatives are bounded (it is not difficult to obtain such 
an atlas, by making coordinate neighborhoods smaller if necessary), and for 
any bundle chart from such an atlas, 


ip : E w -► U x M" 

we have that iposm is contained in H k,r (U). We note the following consistency 
property: If <p% : E\ Vl — > U\ x M™, p 2 ■ E\ U2 ► U 2 x K" are two such bundle 
charts, then <pios|j/ in (/ 2 is contained in H k ’ r (UinU 2 ) if and only if <P2 0 S\u 1 nu 2 
is contained in this space. The reason is that the coordinate change p 2 o pf 1 
is of class C°° , and all derivatives are bounded on the support of s which was 
assumed to be compact. 

We can extend our product (•,•) to L 2 (fi p (M)). It remains bilinear, and 
also positive definite, because as usual, in the definition of L 2 , functions that 
differ only on a set of measure zero are identified. 

We now make the assumption that M is compact, in order not to always 
have to restrict our considerations to compactly supported forms. 

Definition 2.1.1 d* is the operator which is (formally) adjoint to d on 
® i? p (M) w.r.t. (•,•)■ This means that for a € i7 p_1 (M),/3 G f2 p (M) 

p = 0 

(da,/3) = (a,<T/3); (2.1.10) 

d* therefore maps fl p (M) to fI p ~ 1 {M). 

Lemma 2.1.4 d* : f2 p (M) — > fP ~ 1 ( M ) satisfies 

d* = (-l) d{p+1)+1 *d* . (2.1.11) 


Proof. For a G S2 P ~ 1 (M),(3 G Q P {M) 

d{a A */3) = da A */3 + (— l) p_1 a Ad* (3 

= da A */3 + (-lf-^-l )(P- 1 )( d -P+ 1 ) a A **(d*p) 
by Lemma 2.1.1 (d* f3 is a (d — p + l)-form) 

= da A */3 - (~l) d(p+1)+1 a A* * d* (3 
= ± * ((da, (3) - (-l) d(p+1)+1 (a, *d * (3)). 
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We integrate this formula. By Stokes’ theorem, the integral of the left hand 
side vanishes, and the claim results. □ 

Definition 2.1.2 The Laplace (-Beltrami) operator on fi p (M) is 
A = dd* + d*d : f2 p (M) -* f2 p (M). 
ui G f] p (M) is called harmonic if 


Au> = 0 . 


Remark. Since two stars appear on the right hand side of (2.1.11), d* and 
hence also A may also be defined by (2.1.11) on nonorientable Riemannian 
manifolds. We just define it locally, hence globally up to a choice of sign 
which then cancels in (2.1.11). Similarly, the L 2 -product can be defined on 
nonorientable Riemannian manifolds, because the ambiguity of sign of the * 
involved cancels with the one coming from the integration. 

More precisely, one should write 

d p : O p (M) -» O p+1 (M) 
d* : Q P (M) -> n p -\M). 

Then 

A p = d p - x d* + d*d p : L2 P (M) -> L1 P (M). 

Nevertheless, we shall usually omit the index p. 

Corollary 2.1.1 A is (formally) selfadjoint, i.e. 

(Aa, 0) = (a, A0) for a, 0 e f2 p (M). 

Proof. Directly from the definition of A. □ 

Lemma 2.1.5 Aa = 0 da = 0 and d*a = 0. 

Proof. 

“ <*= ” : obvious 

“ => ” : (Aa, a) = ( dd*a , a) + ( d*da , a) = ( d*a , d*a) + (da, da). 

Since both terms on the right hand side are nonnegative and vanish only 
if da = 0 = d*a , Aa = 0 implies da = 0 = d*a. □ 

Corollary 2.1.2 On a compact Riemannian manifold, every harmonic func- 
tion is constant. □ 


Lemma 2.1.6 *A = A* . 
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Proof. Direct computation. □ 


We want to compare the Laplace operator as defined here with the stan- 
dard one on R d . For this purpose, let / : — > M be a differentiable function. 

We have 

,, 9f i 

d, = a? ix 

d 

and for = ipidx 1 with compact support, and = E (— l) 2 1 (fidx 1 A ... A 

i= 1 

dx i A ... A dx d 


( df , p) = J ^.ipidx 1 A ... A dx d 


f dip 1 

= — / f-g—dx 1 A ... A dx d , since p is compactly supported. 


It follows that d*p = — = — div p, and 

d 02 * 

Af = d*df = -Y J = - div (grad /). 

This Laplace operator therefore differs from the usual one on by a minus 
sign. This is regrettable, but cannot be changed any more since the notation 
has been established too thoroughly. With our definition above, A is a positive 
operator. 

More generally, for a differentiable function / : M — > R 

with g := det(<?y). This is seen as follows: 

Since for functions, i.e. 0-forms, we have d* = 0, we get for p : M — > R. 
(differentiable with compact support) 

J Af ■ ipy/gdx 1 A ... A dx d = (Af, p) = (df, dp) 

= J ( df , dp) * (1) 

= [ g^^l^r^gdx 1 ...dx d 

J dx 1 dx 1 v 

j G^l?) v ^ sii1 ■ ■ ■ dx "• 


and since this holds for all p £ Cq° ( M, R) , (2.1.12) follows. 
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For a function /, we may define its gradient as 

7/:=gr»d/:= 9 «^A. (2.1.13) 

We thus have for any vector field X 

(grad f,X) = X(f) = df(X). (2.1.14) 

The divergence of a vector field Z = Z l ,'Z is defined as 

di ^ : = (2 ' L15) 

(2.1.12) then becomes 

Af = — div grad /. (2.1.16) 

In particular, if M is compact, and / : M — > R is a smooth function, then 
as a consequence of (2.1.16) and (2.1.15) or (2.1.12) and the Gauss theorem, 
we have 

/Af*(D=0. (2.1.17) 

M 

We now want to compute the Euclidean Laplace operator for p- forms. It 
is denoted by A e \ likewise, the star operator w.r.t. the Euclidean metric is 
denoted by * e , and d* is the operator adjoint to d w.r.t. the Euclidean scalar 
product. 

Let now 

to = 0 Ji 1 ...i p dx H A ... A dx lp 

be a p-form on an open subset of as usual with an increasing p- 
tuple 1 < ii < i 2 <■■■< i p < d. We choose ji : ... . j,i- P such that 
gJG, . . . , gjjri • ■ • , 9x ? d - p is a positive orthonormal basis of In the 
sequel always 

l G {1 ,...,p},k G {1 ,...,d-p}. 

Now 


d—p 


duj = Yl 


dw. 


fe = i 


" 1 : %p dx^ k A dx 11 A ... A dx lp 
dxi* 


d—p 


dbj. 


du = ^(-i)P+fc- 1 £\* p dx ? 1 A ... A dxi «- A ... A dx jd ~ p (2.1.18) 


fc= l 
d—p 


d* e du> = Y^(- i) p+fe 


k=l 


_i 9 

(chb'fc ) 2 


dx^ k A da;- 51 A ... A dx^ k A ... A dx ld ~ p 


i—p p 


+ Y Y(-l) p+k ~ \ y lp dx H Adx J1 A . . . Adx^ A ... A dx**-’ (2.1.19) 
Z^Z^^ > dx^dx'i v ’ 


fc= l £=i 
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d 2 o 


* e d* e diu = y (-1 )P+P( d -P) dx* 1 A ... A dx* p 


fc= l 


d—p p 


+EE(~ 1 ) pd+ ' 

fc= i e=i 

Hence with (2.1.11) 


-ip 

dx^ k dx ie 


dx Jfc A dx* 1 A ... A dx** A ... A dx* p . (2.1.20) 


d 2 to 

d*dio = ^(-1) ^T^ dx* 1 A ... A dx* p 
k=l 
d—p p 


(dx^y 


Q 2 

+ V V (-l / +1 — yy^dx Jk A dx* 1 A . . .Adx** A. . .A dx* p . (2.1.21) 
^ ’ dx^dx^ v ' 


fc=i ^=1 

Analogously 


* e u = Wj 1 ...i p dx- 71 A ... A dx Jd ~ p 


P Q^. 

d* e u> = y — yy dx ,f A dx n A ... A dx ld ~ p 


dx l e 


(2.1.22) 

(2.1.23) 


<9ua 


* e d * e w = y(-i)p( d -p)+ d -p+^~ i yy 
' <9x** 


<=i 


dx* 1 A ... A dx** A ... A dx* p 
p «2, . 

d,,d,,u, = ’Z(- 1 ) r<d - ,)+, ‘- r+, - 1 ls ^y 


(2.1.24) 


dx** A dx n A ... A dx** A ... A dx* p (2.1.25) 


p d—p 


+ y yf-iy-PH^-P+l- 1 d.r 11 A dx* 1 A ... A dx« A ... A dx* p , 

> dx l *dx3* 

e=i fc= i 

hence with (2.1.24) 


P n2 

dd*w = y(-l) rn n y r , p dx l1 A ... A dx lp 
e=i 


(dx^y 


(2.1.26) 


p d—p 


+EE(- 1 ) 


> 3 2 w,;, 


£=i fc=i 

(2.1.21) and (2.1.26) yield 


<9x** 3x J ’ fe 


-dx Jk A dx 11 A ... A dx** A ... A dx* p 


Z\ e w = d*cLo + dd*u) = 


(-DE 


(dx m ) 2 


dx* 1 A ... A dx* p . 


(2.1.27) 
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Some more formulae: 
We write 


V '■= \fgdx 1 A ... A dx d =: r)i 1 ...i i dx 11 A ... A dx Zd . 
For (3 = ft j 1 ...j p dx : ' 1 A ... A dx^ p 

P 1 -** ■= 9 nh 9 i2j2 ■■■9 ipjp Pn... Jp - 
With these conventions, for a = A ... A dx lp 


{*a)i p+1 ...i d = -jVii...i P <x n - tp 


and 


Further 




Mfd&ki-L.-.ip-! _ pj a 


dx e 




■ip — i ) ' 


(a,0) = d 1 ' 

(<*»•«) = da g x P 9> AV‘" .“!>«- 


(2.1.28) 

(2.1.29) 

(2.1.30) 

(2.1.31) 

(2.1.32) 

(2.1.33) 


(d*a,d*p) = (g u ( 

r( 


dx e 

9Pmj 1 ...j p _ 1 


- r[ 


eq A. . .Ae, 


cte" 


- n 


%@rj 1 ...j p - 1 'jej 1 A- • • Ae jp _J (2.1.34) 


dcXki 1 ...i p - 1 dPmji...j p -i 

dx e dx n 

daki 1 ...i p - 1 


g U g mn g i «i . . . gip-ljp-1 


dx e 

dpmji...j p -i 

dx n 


pi R ■ ■ n kC n ip-ijp - 1 

- 1 mnl J iJl--3 P -l9 ■■■y 


r j n a kl a mn a iljl Jp-dp-i 

1 1 / 9 ■■■9 


Formula (2.1.30) is clear. (2.1.31) may be verified by a straightforward, but 
somewhat lengthy computation. We shall see a different proof in 3.3 as a 
consequence of Lemma 3.3.4. The remaining formulae then are clear again. 


2.2 Representing Cohomology Classes 
by Harmonic Forms 

We first recall the definition of the cle Rham cohomology groups. Let M 
be a differentiable manifold. The operator d : ft p (M) — > ft P+1 (M) satisfies 
(Theorem 1.5.5) 

dod = 0 (dod : ft p (M) -> ft p+2 (M)). (2.2.1) 
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a € f2 p (M) is called closed if da = 0, exact, if there exists rj € Q p ~ 1 ( M ) 
with dr] = a. Because of (2.2.1), exact forms are always closed. Two closed 
forms a, (3 £ f2 p (M) are called cohomologous if a — (3 is exact. This property 
determines an equivalence relation on the space of closed forms in f2 p (M), 
and the set of equivalence classes is a vector space over R, called the p-tli de 
Rham cohomology group and denoted by 

H P dR (M,R). 

Usually, however, we shall simply write 

H p (M). 

In this Paragraph, we want to show the following fundamental result: 

Theorem 2.2.1 Let M be a compact Riemannian manifold. Then every co- 
homology class in H P (M) (0 < p < d = dimM) contains precisely one 
harmonic form. 

Proof. Uniqueness is easy: Let wi,w 2 G f2 p (M) be cohomologous and both 
harmonic. Then either p = 0 (in which case u>i = w 2 anyway) or 

(wi - i02 , - w 2 ) = (^1 - to 2, drj) 

for some?? € J? P_1 (M), since 
oj\ and u >2 are cohomologous 
= (d* (uq - w 2 ),??) 

= 0, since wi andw 2 are harmonic, 
hence satisfy d*u>\ = 0 = d*ui 2 ■ 

Since (•, •) is positive definite, we conclude u>\ = w 2 , hence uniqueness. 

For the proof of existence, which is much harder, we shall use Dirichlet’s 
principle. 

Let lo o be a (closed) differential form, representing the given cohomology 
class in H P (M). 

All forms cohomologous to u>q then are of the form 
u> = w 0 + da (at f2 p ~ 1 (M)). 

We now minimize the L 2 -norm 

D(pS) := (lo,lo) 


in the class of all such forms. 

The essential step consists in showing that the infimum is achieved by a 
smooth form r]. Such an rj then has to satisfy the Euler-Lagrange equations 
for D , i.e. 
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0 = -^(77 + td/3, r] + td/3)\ t =o for all/3 € 1 (M) 

= 2 ( 77 , d/3). (2.2.2) 

This implies 5rj = 0. Since dp = 0 anyway, 77 is harmonic. 

In order to make Dirichlet’s principle precise, we shall need some re- 
sults and constructions from the calculus of variations. Some of them will 
be merely sketched (see §A.l, A. 2 ), and for details, we refer to our textbook 
“Postmodern Analysis”, Springer, 3rd edition, 2005. First of all, we have to 
work with the space of L 2 - for ms instead of the one of C°°-forms, since we 
want to minimize the L 2 -norm and therefore certainly need a space that is 
complete w.r.t. L 2 -convergence. For technical purposes, we shall also need 
Sobolev spaces which we now want to define in the present context (See also 
§A.l of the Appendix). 

On f2 p (M), we introduce a new scalar product 

((w, u))) := ( duj , duS) + (6 cj, Suj) + (w, u>) 

and put 

||w||ff 1 >2(M) := ((w,w)) s . 

We complete the space Q P {M) of smooth p-forms w.r.t. the || • \\h 1 ’ 2 (m)- 
norm. The resulting Hilbert space will be denoted by or simply by 

H 1 ’ 2 (M), if the index p is clear from the context. 

Let now V C be open. For a smooth map / : V — > R", the Euclidean 
Sobolev norm is given by 

\v v ) 

the dot • denoting the Euclidean scalar product. 

With the help of charts for M and bundle charts for A P (AI) for every 
xq € Ad, there exist an open neighborhood U and a diffeomorphism 

p : A p (M)\u V x I n 

where V is open in R d , n = (^) is the dimension of the fibers of A p (Ad), and 
the fiber over x £ U is mapped to a fiber { 7 r(<p(a;))} x R ra , where 7r : V x R n — > 
V is the projection onto the first factor. 

Lemma 2.2.1 On any U' <s U , the norms 

and\\ip{u>)\\ H i ,2 (y/) 

eucl. 

(with V' := ir(<p(U'))) are equivalent. 
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Proof. As long as we restrict ourselves to relatively compact subsets of U, 
all coordinate changes lead to equivalent norms. Furthermore, by a covering 
argument, it suffices to find for every x in the closure of U' a neighborhood 
U" on which the claimed equivalence of norms holds. 

After these remarks, we may assume that first of all 7ro tp is the map onto 
normal coordinates with center Xo, and that secondly for the metric in our 
neighborhood of Xq . we have 

\ 9 ij{x) - Sij | < e and \r l jk (x)\ < eior i,j, k = l,...,d (2.2.3) 

The formulae (2.1.32) - (2.1.34) then imply that the claim holds for suffi- 
ciently small e > 0, i.e. for a sufficiently small neighborhood of Xq. Since 
U' C U is compact by assumption, the claim for U' follows by a covering 
argument. □ 

Lemma 2.2.1 implies that the Sobolev spaces defined by the norms 

|| • ||_f/i, 2 ( M ) and || • || ff i ,2 coincide. Hence all results for Sobolev spaces in 
eucl. 

the Euclidean setting may be carried over to the Riemannian situation. In 
particular, we have Rellich’s theorem (cf. Theorem A. 1.8): 

Lemma 2.2.2 Let (w n )neN C HL 2 (M) be bounded, i.e. 

W^nWm^iM) < K. 

Then a subsequence of (u> n ) converges w.r.t. the L 2 -norm 

IM| l2 (M ) := 

to some lv € H^ 2 (M). □ 

Corollary 2.2.1 There exists a constant c, depending only on the Rieman- 
nian metric of M, with the property that for all closed forms [3 that are 
orthogonal to the kernel of d*, 

((3,(3) <c(d*(3,d*(3) (2.2.4) 


Proof. Otherwise, there would exist a sequence of closed forms (3 n orthogonal 
to the kernel of d* , with 


((3 n ,(3 n ) > n(d* (3 n ,d* (3 n ) 


We put 


Then 


n • ((3m (3 n) 2 • 

1 = (\ n (3nAn(3n) > Tl(d* (\ n (3 n ) , d* (\ n (3 n )) . 


Since d(3 n — 0, we have 


(2.2.5) 


( 2 . 2 . 6 ) 
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W^uPtiWh 1 ’ 2 < 1 h — • 
n 

By Lemma 2.2.2, after selection of a subsequence, A n /3 n converges in L 2 to 
some form ip. By (2.2.6), d*(X n /3 n ) converges to 0 in L 2 . Hence d*ip = 0; this 
is seen as follows: 

For all <p 


0 = lim (d*(\ n /3 n ),ip) = lim(A„/3„, dip) 

n— »■ oo 

= (ip, dip) = ( d*ip , i p) and hence d*ip = 0. 

(With the same argument, d(3 n = 0 for all n implies dip = 0.) 

Now, since d*ip = 0 and (3 n is orthogonal to the kernel of d * , 

(Vh KPn) = 0. (2.2.7) 

On the other hand, (\ n f3 n , X n /3 n ) = 1 and the L 2 -convergence of A n /3 n to ip 
imply 

lim ( ip,X n /3 n ) = 1. 

n— »oo 

This is a contradiction, and (2.2.5) is impossible. □ 

We can now complete the proof of Theorem 2.2.1: 

Let (<x> n ) n gN be a minimizing sequence for D(uS) in the given cohomology 
class, i.e. 


— to o + da n 

D(u> n ) -► inf D(w) =: k (2.2.8) 

uj=ujQ-\-da 

By (2.2.8), w.l.o.g. 

(lVn,COn) = D(ui n ) < k + 1. (2.2.9) 

As with Dirichlet’s principle in R d , uj n converges weakly to some iv, after 
selection of a subsequence. 

We have 

(w — wo, P>) = Ofor all ip £ Q P (M) with d* ip = 0, (2.2.10) 

because 


(u> n — wo, ip) = ( da n , i p) = ( a n , d*ip) = 0 for all such p. 

(2.2.10) means that w — wo is weakly exact. 

We want to study this condition more closely and put 

77 := w — wo- 

We define a linear functional on d*(fl p (M)) by 
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£(5<p) := (77, <p) 


( 2 . 2 . 11 ) 


t is well defined; namely if d*ipi = d*(f 2 , then 

(»?, <Pi - <^2) = 0 by (2.2.10). 

For ip £ f] p (M) let n ((p) be the orthogonal projection onto the kernel of d*, 
and xp := <p — 7r(y>); in particular d*ip = d*tp. 

Then 

£(d*<p) = £(d*ip) = (77, ip) (2.2.12) 

Since ip is orthogonal to the kernel of S , by Corollary 2.2.1 

IIV’IU 2 < c\\d*ip\\ L 2 = c\\d*ip\\ L 2 (2.2.13) 

(2.2.12) and (2.2.13) imply 

\£(d*p)\ < c||?7|| L 2||dV|| i 2. 

Therefore, the function £ on d*(f2 p (M)) is bounded and can be extended 
to the L 2 -closure of d*(f2 p (M)). By the Riesz representation theorem, any 
bounded linear functional on a Hilbert space is representable as the scalar 
product with an element of the space itself. Consequently, there exists a with 

(a,<fip) = (ri,<p) (2.2.14) 


for all ip £ f2 p (M). 

Thus, we have weakly 

da = 77. (2.2.15) 

Therefore, u> = ojq + 77 is contained in the closure of the considered class. 
Instead of minimizing among the to cohomologous to u>o, we could have min- 
imized as well in the closure of this class, i.e., in the space of all u> for which 
there exists some a with 


(a, d* <p) = (w — wo, y>) for all<p £ 

Then w, as weak limit of a minimizing sequence, is contained in this class. 
Namely, suppose LO n = ojq + da n weakly, i.e. 

£ n {d*ip) := (a n , d* <p) = (w„ - w 0 , <p) V p £ H P {M). 

By the same estimate as above, the linear functionals £ n converge to some 
functional t, again represented by some a. Since D also is weakly lower semi- 
continuous w.r.t. weak convergence, it follows that 

k < D(oj) < lim inf D(lo u ) = k, 

n— >00 

hence 

D(u>) = K. 
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Furthermore, by (2.2.2), 

0 = (w,d/3) for all/3 G (2.2.16) 


In this sense, u> is weakly harmonic. 

We still need the regularity theorem implying that solutions of (2.2.16) 
are smooth. This can be carried out as in the Euclidean case. If one would be 
allowed to insert /3 = d*cu in (2.2.16) and integrate by parts, it would follow 
that 

0 = (d*u>,d*uj), 


i.e. d*u> = 0. 

Iteratively, also higher derivatives would vanish, and the Sobolev embed- 
ding theorem would imply regularity. However, we cannot yet insert (3 = d*u, 
since we do not know yet whether dd*ui exists. This difficulty, however, may 
be overcome as usual by replacing derivatives by difference quotients (See 
§A.2 of the Appendix.). In this manner, one obtains regularity and com- 
pletes the proof. □ 


Corollary 2.2.2 Let M be a compact, oriented, differentiable manifold. Then 
all cohomology groups H dR (M,M.) (0 < p < d := dim M) are finite dimen- 
sional. 


Proof. By Theorem 1.4.1, a Riemannian metric may be introduced on M. By 
Theorem 2.2.1 any cohomology class may be represented by a form which 
is harmonic w.r.t. this metric. We now assume that H P {M) is infinite di- 
mensional. Then, there exists an orthonormal sequence of harmonic forms 
(i?n)ne n C H p (M), i.e. 

{Vn, Vm) = 5nm for 71, TO G N. (2.2.17) 

Since the r) n are harmonic, d*r] n = 0, and dr] n = 0. By Rellich’s theorem 
(Lemma 2.2.2), after selection of a subsequence, (?/„) converges in L 2 to some 
p. This, however, is not compatible with (2.2.17), because (2.2.17) implies 

\\rjn ~ r) m \\ L 2 > 1 forn ^ to, 

so that (?/„) cannot be a Cauchy sequence in L 2 . 

This contradiction proves the finite dimensionality. □ 


Let now M be a compact, oriented, differentiable manifold of dimension 
d. We define a bilinear map 

H p dR (M,R) x H d d ~ p (M,R) 


by 


(^, 77 ) 


I — > 


/ 


LU A 7] 


(2.2.18) 
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for representatives 07,77 of the cohomology classes considered. It remains to 
show that (2.2.18) depends only on the cohomology classes of lo and 77, in 
order that the map is indeed defined on the cohomology groups. If, however, 
lo' and lo are cohomologous, there exists a (p — 1) form a with to' = lo + da, 
and 

/ lo' A 77 = f (lo + da) A 77 = J lo A 77 + / da A 77 

MM MM 

= / lo A 77 + / d(af\rf), 

M M 

since 77 is closed representing a cohomology 

class by Stokes’ theorem 
= f lo A T] . 

M 

Therefore, (2.2.18) indeed depends only on the cohomology class of lo, and 
likewise only on the cohomology class of 77. 

Let us now recall a simple result of linear algebra. Let V and W be finite 
dimensional real vector spaces, and let 


(•,•) : Vx IT-^R. 

be bilinear and nondegenerate in the sense that for any v G V,v ^ 0 , there 
exists w G W with (v,w) ^ 0, and conversely. Then V can be identified with 
the dual space W* of W, and W may be identified with V*. Namely, 

i\ : V — > W* with *i(u)(tzj) := (v,w) 
i 2 : IT — > V* with i2(w)(v) := (v,w) 

are two injective linear maps. Then V and W must be of the same dimension, 
and ?’i and are isomorphisms. 

Theorem 2.2.2 Let M be a compact, oriented, differentiable manifold of di- 
mension d. The bilinear form ( 2 . 2 . 18 ) is nondegenerate, and hence H^ R (M , R) 
is isomorphic to (H^ R P (M, R))*. 

Proof. For each nontrivial cohomology class in H P (M), represented by some 
lo (i.e. dio = 0, but not lo = da for any (p— l)-form a), we have to find some 
cohomology class in H d ~ p (M) represented by some 77, such that 

J lo A 77 ^ 0. 

M 

For this purpose, we introduce a Riemannian metric on M which is possible 
by Theorem 1.4.1. By Theorem 2.2.1, we may assume that lo is harmonic 
(w.r.t. this metric). By Lemma 2.1.6 


A * lo = *Aio, 
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and therefore, *lo is harmonic together with ui. Now 

J wAw=(w,w)/0, since ui does not vanish identically. 

M 

Therefore, *oj represents a cohomology class in H d ~ p (M) with the desired 
property. Thus the bilinear form is nondegenerate, and the claim follows. □ 

Definition 2.2.1 The p-th homology group of a compact, differ- 
entiable manifold M is defined to be M))*. The p-th Betti number 

of M is b p (M) := dim H P (M,R). 

With this definition, Theorem 2.2.2 becomes 

H P (M,R) SS H*~ P (M,R). (2.2.19) 

This statement is called Poincare duality. 

Corollary 2.2.2 Let M be a compact, oriented, differentiable manifold of 
dimension d. Then 

Hd R (M,R) = R. (2.2.20) 

and 

b p (M) = b d - p (M) for 0 < p < d. (2.2.21) 


Proof. H'ff ( M, R) = K. This follows e.g. from Corollary 2.1.2 and Theorem 
2.2.1, but can also be seen in an elementary fashion. 

Theorem 2.2.2 then implies (2.2.20), as well as (2.2.21). □ 


As an example, let us consider an n-dimensional torus T n . As shown 
in 1.4, it can be equipped with a Euclidean metric for which the covering 
7 r : K n — > T n is a local isometry. 

By (2.1.27), we have for the Laplace operator of the Euclidean metric 


A(u 


A ... A dx ip ) = (— 1) 


d 2 L0i 


m — 1 


( dx m ) 


ffdx 11 A ... A dx 


(a; 1 , . . . ,x n Euclidean coordinates of K"). 


Thus, a p-form is harmonic if and only if all coefficients w.r.t. the basis 
dx 11 A ... A dx lp are harmonic. Since T n is compact, by Corollary 2.1.2, they 
then have to be constant. Consequently 

b p {T n ) = dim lP(T n ) = dim A p (K n ) = Q (0 < p < n). 
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2.3 Generalizations 

The constructions of this chapter may easily be generalized. Here, we only 
want to indicate some such generalizations. 

Let E and F be vector bundles over the compact, oriented, differentiable 
manifold M. Let r(E) and r(F) be the spaces of differentiable sections. 
Sobolev spaces of sections can be defined with the help of bundle charts: 
Let (/,[/) be a bundle chart for E, f then identifies E\u with U x R n . A 
section s of E is then contained in the Sobolev space H k ’ p (E) if for any such 
bundle chart and any U' <s U, we have P 2 ° / ° s\jji £ H k ’ p (U', ]R n ), where 
p 2 : U' x R" — > ]R" is the projection onto the second factor. 

A linear map L : r(E ) — > r(F) is called (linear) differential operator of 
order t from A to A if in any bundle chart, L defines such an operator. For 
the Laplace operator, of course E = F = A P (T*M), £ = 2. 

In a bundle chart, we write L as 

L = P e (D) + ... + P 0 (D), 

where each Pj(D) is an (m x n)-matrix (■ m,n = fiber dimensions of E and 
F, resp.), whose components are differential operators of the form 

y, a a (x)D a 

M =i 

where a is a multi index, and D a is a homogeneous differential operator of 
degree |a| = j. Let us assume that the a a (x) are differentiable. 

For £ = (^, . . . ,£ m ) e let Pj(£) be the matrix obtained for Pj{D) 
by replacing D a by £ Q . 

Pj(0 thus has components 

a a {x)C- 

l«l=j 

L is called elliptic at the point x, if P^(£) (£ = degree of L ) is nonsingular at 
x for all £ £ R m \{0}. Note that in this case necessarily n = m. 

L is called elliptic if it is elliptic at every point. Let now W)e and (•, -)f 
be bundle metrics on E and F, resp. (those always exist by Theorem 1.5.3), let 
M carry a Riemannian metric (existing by Theorem 1.4.1) and an orientation. 
Integrating the bundle metrics, for example 

(•, • )e := J (*, • )Edvo\g (ckvoig = yjdet (gij)dx 1 A ... A dx d ), 

M 

we obtain L 2 -metrics on r(E) and r(F). Let L* be the operator formally 
adjoint to L, i.e. 


( Lv,w)f = ( v,L*w)e foru £ r(E),w £ F(F). 
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L is elliptic if L* is. 

The importance of the ellipticity condition rests on the fact that solutions 
of elliptic differential equations are regular, and the space of solutions has 
finite dimension. 

Here, however, this shall not be pursued any further. 


Exercises for Chapter 2 

1) Compute the Laplace operator of S n on p-forms (0 < p < n) in the 
coordinates given in § 1 . 1 . 

2) Let lo € 17 1 (S' 2 ) be a 1-form on S 2 . Suppose 

p* u = to 

for all p £ SO (3). Show that ui = 0. 

Formulate and prove a general result for invariant differential forms 
on S n . 

3) Give a detailed proof of the formula 

*Z\ = A * . 

4) Let M be a two dimensional Riemannian manifold. Let the metric be 
given by gtj(x)dx l (g> dx J in local coordinates (a; 1 , a; 2 ). Compute the 
Laplace operator on 1-forms in these coordinates. Discuss the case 
where 

9 ij{x) = A 2 (a :)6ij 
with a positive function A 2 (a:). 

5) Suppose that a € H^ 2 {M) satisfies 

( d*a , d*<p) + (da, dip) = ( 77 , p) for all p G f] p (M), 
with some given rj £ f2 p (M). Show a £ Q P (M), i.e. smoothness of a. 

6 ) Compute a relation between the Laplace operators on functions on 
]R" +1 and the one on S n C R" +1 . 

7) Eigenvalues of the Laplace operator: 

Let M be a compact oriented Riemannian manifold, and let A be the 
Laplace operator on f2 p (M). A £ R is called eigenvalue if there exists 
some u £ f2 p (M),u ^ 0, with 


Au = Xu. 
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Such a u is called eigenform or eigenvector corresponding to A. The 
vector space spanned by the eigenforms for A is denoted by V\ and 
called eigenspace for A. 

Show: 

a: All eigenvalues of A are nonnegative, 

b: All eigenspaces are finite dimensional, 

c: The eigenvalues have no finite accumulation point, 

d: Eigenvectors for different eigenvalues are orthogonal. 

The next results need a little more analysis (cf. e.g. [139]) 
e: There exist infinitely many eigenvalues 

Ai A A2 ^ ... ^ A„ < . . . 

f: All eigenvectors of A are smooth. 

g: The eigenvectors of A constitute an L 2 -orthonormal basis for 

the space of p - forms of class L 2 . 

8) Here is another long exercise: 

Let M be a compact oriented Riemannian manifold with boundary 
d M 7! 0. For x £ dM, V £T X M is called tangential if it is contained 
in T x dM C T X M and W £ T X M is called normal if 

(V, W) = 0 for all tangential V. 

An arbitrary Z £ T X M can then be decomposed into a tangential and 
a normal component: 


Z — -^tan + Z nor- 

Analogously, 77 £ r p (T x ,M) can be decomposed into 

V = Tytan + Vnor 

where 77 tan operates on tangential p - vectors and ?7 nor on normal ones. 
For p-forms lu on M, we may impose the so-called absolute boundary 
conditions 

w tan = 0 


((5u;)nor — 0 


on dM 
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or the relative boundary conditions 

^nor = 0 

(dw) n or = 0. 


on d M 


(These two boundary conditions are interchanged by the *-operator.) 
Develop a Hodge theory under either set of boundary conditions. 



3. Parallel Transport, Connections, and 
Co variant Derivatives 


3.1 Connections in Vector Bundles 

Let X be a vector field on f 1 , F a vector at Xq £ R d . We want to analyze 
how one takes the derivative of X at xq in the direction V. For this derivative, 
one forms 

r X(x 0 + tV)~X(x 0 ) 

iim . 

t— >o t 

Thus, one first adds the vector tV to the point Xq. Next, one compares the 
vector X{xq + tV) at the point Xq + tV and the vector X(xo) at Xo; more 
precisely, one subtracts the second vector from the first one. Division by t 
and taking the limit then are obvious steps. 

A vector field on R d is a section of the tangent bundle T(R d ). Thus, 
X(xq + tV) lies in Ta; 0+t y(K d ), while V(a.’o) lies in T Xo (R d ). The two vectors 
are contained in different spaces, and in order to subtract the second one from 
the first one, one needs to identify these spaces. In R d , this is easy. Namely, 
for each x £ R d , T x R d can be canonically identified with ToR d = R d . For 
this, one uses Euclidean coordinates and identifies the tangent vector .((, at 
x with at 0. This identification is even expressed by the notation. The 
reason why it is canonical is simply that the Euclidean coordinates of R d 
can be obtained in a geometric manner. For this, let c(t) = tx,t £ [0,1] the 
straight line joining 0 and x. For a vector X\ at x , let X t be the vector at 
c(t) parallel to X- t : in particular, X t has the same length as X\ and forms the 
same angle with c. X 0 then is the vector at 0 that gets identified with X\. 
The advantage of the preceding geometric description lies in the fact that X\ 
and Xo are connected through a continuous geometric process. Again, this 
process in ]R d has to be considered as canonical. 

On a manifold, in general there is no canonical method anymore for iden- 
tifying tangent spaces at different points, or, more generally fibers of a vector 
bundle at different points. For example, on a general manifold, we don’t have 
canonical coordinates. Thus, we have to expect that a notion of derivative 
for sections of a vector bundle, for example for vector fields, has to depend 
on certain choices. 

Definition 3.1.1 Let M be a differentiable manifold, E a vector bundle over 
M. A covariant derivative , or equivalently, a (linear) connection is a map 
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D : r(E) -► r(E) 0 r(T*M) 

with the properties subsequently listed: 

By property (i) below, we may also consider D as a map from 
r{TM) 0 r(E) to r{E) and write for a G r(E),V G T X M 

Dc r(V) =: D v a. 


We then require: 

(i) D is tensorial in V : 

Dv+w& = Dyer + Dw& for V,W £ T X M , a G r(E) (3.1.1) 

D fv a = fD v c r for/ G C°°(M,M.),V G T{TM). (3.1.2) 

(ii) D is M-linear in cr : 

Dy(a + t) = Dyo + Dyr for V G T X M, <j,t G r(E) (3.1.3) 
and it satisfies the following product rule: 

Dy(fa) = V(f)-a + fD v a for / G C°° (M, R) . (3.1.4) 


Of course, all these properties are satisfied for the differentiation of a 
vector field in as described; in that case, we have DyX = dX(V). 

Let £0 G M, and let U be an open neighborhood of xq such that a chart 
for M and a bundle chart for E are defined on U. We thus obtain coordinate 
vector fields t// , . . . , , and through the identification 

E\jj = U x R n (n = fiber dimension of If), 


a basis of yields a basis /z 1 , . . . , /x„ of sections oiE\u. For a connection D, 
we define the so-called Christoff el symbols (/, k = 1, . . . , n, i = 1, . . . , d) 

by 



(3.1.5) 


We shall see below that the Christoffel symbols as defined here are a gener- 
alization of those introduced in 1.4. 

Let now n G r(E ); locally, we write n(y) = a k (y)y,k(y )• Also let c(t) be a 
smooth curve in U. Putting fi(t) := n(c(t)), we define a section of E along c. 
Furthermore, let V(t) = c(t)(:= ^ c(t )) = 


Then by (3.1.1) - (3.1.5) 


Dy( t )fJi(i) = a k (t)yk{c{t)) + c\t)a k (t)D_a^fj, k 

dx x 

= h k (t)n k (c{t)) + c\t)a k {t)r 3 ik {c{t))^{c{t)). 


(3.1.6) 
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(In particular, DxP depends only on the values of p along a curve c with 
c(0) = X, and not on all the values of p in a neighborhood of the base point 
of X.) 

Dv(t)P(t) = 0 thus represents a linear system of first order ODEs for the 
coefficients a l {t), . . . ,a n (t) of p(t). Therefore, for given initial values p( 0) € 
E c ( o) , there exists a unique solution of 

Dv { tMt)= 0. (3.1.7) 

Definition 3.1.2 The solution p(t) of (3.1.7) is called the parallel transport 
of p(0) along the curve c. 


Thus, if Xq and X\ are points in M, the hbers of E above xq and xi, 
E Xo and E Xl , resp., can be identified by choosing a curve c from Xo to aq 
{xq = c(0),a;i = c(l)) and moving each p 0 £ E Xa along c to E Xl by parallel 
transport. This identification depends only on the choice of the curve c. One 
might now try to select geodesics w.r.t. a Riemannian metric as canonical 
curves, but those are in general not uniquely determined by their endpoints. 

From parallel transport on a Riemannian manifold, i.e. the identification 
of the fibers of a vector bundle along curves, one may obtain a notion of 
covariant derivative. For this purpose, given V £ T X M, let c be a curve in M 
with c(0) = ai,c(0) = V. For p £ r(E), we then put 


DyU := lim 
t-> o 


P c ,t(p(c(t ))) - p(c( 0)) 


where P c t : E c ^ — » E c ( 0j is the identification by parallel transport along 
c. In order to see that the two processes of covariant derivative and parallel 
transport are equivalent, we select a basis of parallel sections pi(t), . . . . p n (t) 
of E along c, 
i.e. 

D i(t)^j{t) = 0 for j = 1, , . . , n. (3.1.8) 

An arbitrary section p of E along c is then written as 

p{t) = a k (t)p k (t), 

and for X = c(0), we have 

D x p(t) = a k (t)p k (t) by (3.1.6), (3.1.8) (3.1.9) 


and consequently, 

(DxpMO)) = |im ak{t) ~ ak{0) p k ( 0) 
= lim P c ,t( v(t)) - o) 

t-> 0 t 
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It is important to remark that this does not depend on the choice of the curve 
c, as long as c(0) = X. 

We want to explain the name “connection”. We consider the tangent 
space at the point ^ to the total space E of a vector bundle, T^E. Inside 
T^E. there is a distinguished subspace, namely the tangent space to the fiber 
E x containing i/’ (x = tt(i/))). This space is called vertical space V$. However, 
there is no distinguished “horizontal space” H \p complementary to V^, i.e. 
satisfying T^E = © H^. If we have a covariant derivative D, however, 

we can parallely transport ^ for each X G T X M along a curve c(t) with 
c(0) = x, c(0) = X. Thus, for each X, we obtain a curve ip(t) in E. The 
subspace of 7]pE spanned by all tangent vectors to E at ip of the form 

then is the horizontal space In this manner, one obtains a rule how the 
fibers in neighbouring points are “connected” with each other. 

We return to (3.1.6), i.e. 

(3.1.10) 

v ' dx z 

= a 3 {t)vj{c{t)) + c l {t)a 3 {t)r^(c{t))^ k {c{t)). 

Here, 

a j {t) = c i (t)^ i {c{t)). (3.1.11) 

This part thus is completely independent of D. 

rfj now has indices j and k, running from 1 to n, and an index running 
from 1 to d. The index i describes the application of the tangent vector 
c l (t) q x , . We thus consider {rk.) i j k as an (n x n)-matrix valued 1-form on 
U : 

Ulj),.j. k G r( fl I(n,M) ® T*M W ) (3.1.12) 

(Here, g[(n, K) is the space of (n x n)-matrices with real coefficients.) In a 
more abstract manner, we now write on U 

D = d + A , (3.1.13) 

where d is exterior derivative and A G T(gl(n, M.)®T*M\u). Of course, A can 
also be considered as an (nx n)-matrix with values in sections of the cotangent 
bundle of M; A, applied to the tangent vector becomes (-Ty). 7 ,fc= i,...,n- 
By (3.1.10), the application of A to a J /Zj is given by ordinary matrix multi- 
plication. Once more: 

D(a 3 /ij) = d(a 3 )fij + a 3 Afj,j, (3.1.14) 

where A is a matrix with values in T* M. 
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We now want to study the transformation behavior of A. As in 1.5, let 
(U a ) aeA be a covering of M by open sets over which the bundle is trivial, 
with transition maps 

ififSoc : U a n Up -> G1 (n, R). 

D then defines a T*M-valued matrix A a on U a . Let the section p be repre- 
sented by p a on U a . Here, a Greek index is not a coordinate index, but refers 
to the chosen covering (U a ). Thus, 

M/3 = <M/3 « AG on U a n Up. (3.1.15) 

But then we must also have 

<M/3 a(d + A a )n a = (d + Ap)np on U a n Up; (3.1.16) 

on the left hand side we have first computed Dp in the trivialization defined 
by the U a and then transformed the result to the trivialization defined by 
Up, while on the right hand side, we have directly expressed Dp in the latter 
trivialization. 

We obtain 

A a = VpadVPct + Vpl A f}<Pl3oc- (3.1.17) 

This formula gives the desired transformation behavior. Thus, A a does not 
transform as a tensor (see the discussion following Definition 1.5.10), because 
of the term ip^chp p a . However, the difference of two connections transforms 
as a tensor. The space of all connections on a given vector bundle E thus is 
an affine space. The difference of two connections D\,D 2 is a gl(n, Revalued 
1-form, i.e. D 1 — D 2 & P (End E ® T*M), considering g[(n, R) as the space of 
linear endomorphisms of the fibers. 

We return to our fixed neighborhood U and thus drop the index a. 

We want to extend D from E to other bundles associated with E, in 
particular to E* and End(P) = E <g» E* . 

We now write 

Apj = Ajp k , (3.1.18) 

where each A * now is a 1-form, A j = r^-dx 1 . Let p\, . . . , p„ be the basis dual 
to Mi, • • • , Mn on the bundle E* dual to E, i.e. 

(p i ,p*) = S ij , (3.1.19) 

where (•,•): E ® E* — >■ R is the bilinear pairing between E and E* . 

Definition 3.1.3 Let D be a connection on E. The connection D* dual to 
D on the dual bundle E* is defined by the requirement 

d(p, 1 A) = (Dp, v*) + (p, MV) (3.1.20) 

for any p € r(E), v* G r(E*). 

(Dp G r(E®T*M), and (Dp, v*) pairs the P-factor of Dp with v* . Thus 
(Dp,v*), and similarly (p,D*v*), is a 1-form.) 
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As usual, we write D = d + A on U and compute 

0 = + (w, 

= A{+A*/ by (3.1.19), 


Recalling (3.1.5), we get 


= - 4 / 4 - 


(3.1.21) 


Definition 3.1.4 Let Ei,E 2 be vector bundles over M with connections 
Di,D 2 , resp. The induced connection D on E := Ei (g E 2 is defined by the 
requirement 

D(/ri g /z 2 ) = Di/j.1 g /U 2 + Mi ® D 2 fi 2 (3.1.22) 

for Hi G r(Ei),i = 1,2. 


In particular, we obtain an induced connection on End (E) = E (g) E*, 
again denoted by D. Let a = (g> p* be a section of End {E). We compute 

<g> p*) = dcrjiii (g) it* + a jAt/ik g /z* — a^A^m (g fi* k 

= da+[A,a]. (3.1.23) 

The induced connection on End ( E ) thus operates by taking the Lie bracket. 

We next want to extend the operation of a connection D from r(E) to 
r(E)<g>f2 p (M) (0 < p < d). Since, on 17 P (M), we have the exterior derivative 
d , we define in analogy with Definition 3.1.4 for /z G r(E),u G 17 P (M) 

D{n 0w) = Dp A w + p (g> dw. (3.1.24) 

(Here, we have employed a wedge product of forms with values in vec- 
tor bundles, as Dii is an element of r(E) (g ) 17 1 (M): If tr G r(E),u i G 
fi l ( y M),ui 2 € 17 P (M), then 

(cr g) Wi) Aw 2 := a g (wi A 0^2)5 

and the general case is defined by linear extension.) 

As an abbreviation, we write 

Q P {E) := r(E) <g> f2 p (M), f! p := fi p (M). 

Thus 

D : fl p (E) — > 17 P+1 (E), 0 <p < d. 

We want to compare this with the exterior derivative 


d : n p -> I7 p+1 . 
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Here, we have 


d o d = 0. 


Such a relation, however, in general does not hold anymore for D. 


Definition 3.1.5 The curvature of a connection D is the operator 
F := D o D : F2°(E) -► F2 2 (E). 

The connection is called flat, if its curvature satisfies F = 0. 


The exterior derivative d thus yields a flat connection on the trivial bundle 
M x R. 

We compute for fj, G r(E) 

F(fl) = (d + A) o (d + A)n 
= (d + Afldfi + Afl) 

= ( dA ) p — Adfi + Ad/r + A A Hp 


(the minus sign occurs, because A is a 1-form). 

Thus 

F = dA + A A A. 

If we write A = Ajdxfl (3.1.24) becomes 

F = (S + AlAj ) dF A dx ° 

1 f dAj dAi A j 

= 2(a?“&J + | - 4 '-' 4 -' 1 ) 


(3.1.25) 


(3.1.26) 


(note that each Aj is an (n x n)-matrix). 

We now want to compute DF. F is a map from FI °(E) to FI 2 (E), i.e. 


F G Fi 2 (E) ® (17 °(E)Y = l7 2 (End E). 


We thus consider F as a 2-form with values in End If. By (3.1.23) then 


DF = dF+ [A, F] 

= dA A A — A A dA + [A, dA + A A A\ by (3.1.23) 

= dA A A — A A dA + A A dA — dA A A + [A, A A A] 
= [A, A A A] 

= [Ajdx®, AjdxA A Afedx fc ] 

= AiAjAfldx 1 A da7 A — dad A dx k A dad) 

= 0 . 


This is the so-called second Bianchi identity. 
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Theorem 3.1.1 The curvature F of a connection D satisfies 

DF = 0. □ 


We now want to study the transformation behavior of F. We use the 
same covering (U a ) ae A as above, and on U ai we write again D = d + A a , 
A a = A a jdx l . F then has the corresponding representation 


Fa = 2 (^T - dxi A dx ° (3.1.27) 

by (3.1.26). Using the transformation formula (3.1.16) for A a , we see that in 
the transformation formula for F a , all derivatives of ipp a cancel, and we have 

E a = TflaFpT 0a • (3.1.28) 

Thus, in contrast to A, F transforms as a tensor. 

We now want to express F in terms of the Christoffel symbols. In order to 
make contact with the classical notation, we denote the curvature operator, 
considered as an element of h? 2 (End E) by R : 


F : n°(E) -> fZ 2 (E) 
and we define the components R\ jC by 


D ( d d \ , 

R \dx v dxi) M~ R UjPk 


(3.1.29) 


(k,£ G {1, • • • , n}, i,j G {1, — , rf}). By (3.1.26) 

#(v)M = F Tt 

= 2 U# “ a# + ^ A ^ (3 - 1 - 30) 


i.e. 


^ drf, 


TDK 

rip a a — 


pk jim y-iAi 


Q x i Q x j ' 1 im 1 j£ 1 jm 


(3.1.31) 


Theorem 3.1.2 The curvature tensor R of a connection D satisfies 

R{X , Y ) /j = DxF>yH — DyDxti — D[ X ,y]V (3.1.32) 

for all vector fields X, Y on M, and all /a G r{E). 


Proof. A direct computation is possible. However, one may also argue more 
abstractly as follows: First, (3.1.32) holds for X = -JF, Y = AF. Namely, in 
this case [X, Y] = 0, and (3.1.32) follows from (3.1.26). 
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We have seen already that R transforms as a tensor (the tensorial trans- 
formation behavior w.r.t. X, Y also follows from (3.1.26), for example), and 
thus the value of R(X,Y)p at the point x depends only on the values of X 
and Y at x. Now for X = C^tjY = rf g|j 

dn> ■ dC 

D x DyH - DyDxn =£' ~ 

OX 1 9x3 ijx J 9x' 

-Cil i (l) a D.m. -Dj^DjA/x 
V dx^ dxO dx3 9s* J 

and 

^[X,Y]^ D(ci dr)i e dj r a 

dx * dxO dxO 9s* 

hence 

DxDy/x — DyDxl-i — D\x Y]i 'J- = (d 9 D 9 — D a D e p , 

and this has the desired tensorial form. □ 

In order to develop the geometric intuition for the curvature tensor, we 
want to consider vector fields X,Y with [X, Y] = 0, e.g. coordinate vector 
fields • Then 

R(X, Y) = D X Dy — DyD X - 

When forming DxDyp, , we first move p by infinitesimal parallel transport 
in the direction Y and then in the direction X\ when forming DyDxP-, the 
order is reversed. R(X,Y)/x then expresses the difference in the results of 
these two operations, or, in other words, the dependence of parallel transport 
on the chosen path. 

Corollary 3.1.1 We have 

R(X, Y) = - R{Y , X). (3.1.33) 

Proof. From (3.1.32). □ 

Corollary 3.1.2 

R k Hj = -R k tji Vi,3,k,t. 

Proof. This reformulation of (3.1.33) also follows from (3.1.30). □ 

Connections on the tangent bundle TM are particularly important: 

Definition 3.1.6 Let V be a connection on the tangent bundle TM of a 
differentiable manifold M. A curve c : I — > M is called autoparallel or geodesic 
w.r.t. V if 
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VcC = 0 (3.1.34) 

i.e. if the tangent field of c is parallel along c. 

In local coordinates, c = c'^^and 

V,c=(c fc + ^.c^)^, (3.1.35) 

and the equation for geodesics has the same form as the one in 1.4. The dif- 
ference is that the Christoffel symbols now have been defined differently. We 
shall clarify the relation between these two definitions below in 3.3. Accord- 
ing to (3.1.35), (3.1.34) is a system of 2nd order ODE, and thus, as in 1.4, 
for each x £ M, X £ T X M , there exist a maximal interval I = lx C R. with 
0 £ lx and a geodesic c= cx 


c : I 


M 


with c(0) = a;,c(0) = X. 

C := {X £ TM : 1 £ I\} is a star-shaped neighborhood of the zero 
section of TAI, and as in 1.4, we define an exponential map by 

exp : C — •> M 
X^c x ( 1). 

If X £ C, 0 < t < 1, then exp (t-X) = exit). 

Definition 3.1.7 The torsion tensor of a connection V on TM is defined as 
T(X,Y) :=T V (X,Y) :=X7 x Y-V y X-[X,Y] {X,Y £ (3.1.36) 

V is called torsion free if 

T = 0. (3.1.37) 


Remark. It is not difficult to verify that T is indeed a tensor, i.e. that the 
value of T(X,Y)(x) only depends on the values of X and Y at the point x. 


In terms of our local coordinates, the components of the torsion tensor T 
are given by 


Tij = T 


_d_ _d_ 
dx i ' dxi 


= 

9a; 1 


— -V 9 - <r k r fc i 9 
dxi ,Cj i)x' 1 •" i).r h ' 


(3.1.38) 


We conclude 


Lemma 3.1.1 The connection V on TAI is torsion free if and only if 

I'!:, - /;') for alii, j,k. (3.1.39). 
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□ 

Definition 3.1.8 A connection V on TM is called flat if each point in M 
possesses a neighborhood U with local coordinates for which all the coordi- 
nate vector fields are parallel, that is, 

v i = °- < 3L40 > 

Theorem 3.1.3 A connection V on TM is flat if and only if its curvature 
and torsion vanish identically. 

Proof. When the connection is flat, all V _a_ A, = 0, and so, all Christoffel 

dx i 

symbols Th = 0, and therefore, also T and R vanish, as they can be expressed 
in terms of the Pf. 

For the converse direction, we need to find local coordinates for which 
0 = g| j for all i,j. Putting Hi := gfo-, we obtain the system 

+ r jiVk = 0 for ali i, j. (3.1.41) 

In vector notation, this becomes 

^u + /> = 0, (3.1.42) 

and by the theorem of Frobenius, this can be solved if and only if the inte- 
grability condition 

[ r ‘’ r >i + ^ r >-^j r ‘ = ° < 3 - L43 > 

holds for all i,j. With indices, this is 

dr\ 

+ r im r !e - = 0 for alH, j, (3.1.44) 

which by equation (3.1.31) means that the curvature tensor vanishes. We can 
thus solve (3.1.41) for the Hi. In order that these /i, are coordinate vector 
fields gfo, the necessary and sufficient condition (again, by the theorem of 
Frobenius) is 

d d 

dP lh = dx^ H foralU,i ’ (3 ' L45) 

which by (3. 1.41) in turn is equivalent to the condition /A - /A for all i,j, k , 
that is, by Lemma 3.1.1, the vanishing of the torsion T. This completes the 
proof. □ 
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Perspectives. Ehresmann was the first to arrive at the correct concept of a 
connection in a vector bundle. Equivalently, the concept may also be introduced in 
a principal bundle (see the discussion at the end of 1.7). The theory of connections 
is systematically explored in [160, 161]. 

The curvature tensor introduced here generalizes the Riemann curvature tensor 
derived from a Riemannian metric in §3.3 below. 

The Bianchi identity (Theorem 3.1.1) may be derived in a more conceptual way 
as the infinitesimal version of the equivariance of the curvature form F with respect 
to certain transformations in horizontal directions, see[214]. 

For a more detailed and elementary discussion of integrability conditions and 
the Frobenius theorem, we refer to [71]. 


3.2 Metric Connections. The Yang-Mills Functional 

Definition 3.2.1 Let E be a vector bundle on the differentiable manifold M 
with bundle metric (•, •). A connection D on E is called metric if 

d(p, v) = ( Du , v) + (yu, Dv) for all /z, v £ T(E). (3.2.1) 


A metric connection thus has to respect an additional structure, namely 
the metric. 

We want to interpret condition (3.2.1). Let X £ T x M : (3.2.1) then means 

X(p.,u) = v) + (n,D x v). (3.2.2) 

Let now c : I —* M be a smooth curve, and let /t(f) and v(t) be parallel along 
c, i.e. Dj/i = 0 = D x v. Then from (3.2.2) 

|(M*), «/(*))= 0. (3.2.3) 

This can be interpreted as follows: 

Lemma 3.2.1 The parallel transport induced by a metric connection on a 
vector bundle preserves the bundle metric in the sense that parallel transport 
constitutes an isometry of the corresponding fibers. 

Namely, (3.2.3) means that the scalar product is preserved under parallel 
transport. □ 

Lemma 3.2.2 Let D be a metric connection on the vector bundle E with 
bundle metric (•,■). Assume that w.r.t. a metric bundle chart, (cf. Definition 
1.5.12 and Theorem 1.5.3), we have the decomposition 

D = d + A. 


Then for any X £ TM , the matrix A(X) is skew symmetric, i.e. 
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A(X) £ o (n) (= Lie algebra of 0{n)) ( n = rank of E). 


Proof. As described in Theorem 1.5.3, a metric bundle chart (/, U) generates 
sections n i> • ■ • iPn on U that form an orthonormal basis of the fiber E x at 
each x £ U, i.e. 

{fj,i(x),Hj(x)) = Sij. 

Moreover, since the Hi are constant in the bundle chart, we have for the 
exterior derivative d defined by the chart 

dfii =0 (i = 1, . . . , n). 


Let now X £ T X M, x £ U. 
It follows that 


0 — X (fii, /ij) — (A(X)fii, fj,j) + (^tj, A(X)fij) 

= {A(X)i Hk, Vj) + (Pi, A(X)j fik) 

= A(X){ + A(X)). □ 


By 

fl p (AdE), 

we denote the space of those elements of 17 p (End£l) for which the endo- 
morphism of each fiber is skew symmetric. Thus, if D = d + A is a metric 
connection, we have 

A £ l? 1 (Ad E). 

We define 

D* : f2 p (Ad E) -► i7 p_1 (AdL;) 
as the operator dual to 

D : f2 p ~ 1 {Ad E) -► f2 p (AdE) 

w.r.t. (•, •); thus 

{D*u,n) = for all /z £ f2 p ~ 1 (Ad E), v £ f2 p (AdE). (3.2.4) 


This is in complete analogy with the definition of d* in 2.1. Indeed, for D = 
d + A (A £ l? 1 (Ad E)),A = Aidx 1 

(v, d/j, + Aidx 1 A n) = ( d*v , / j .) — (AiV, dx l A fj,), since A t is skew symmetric. 

(3.2.5) 


By Lemma 2.1.1, in this case 


** = (-1 )p( d ~p). 
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* : f] p (AdE) — ► [2 d ~P (Ad E) operates on the differential form part as de- 
scribed in 2.1 and leaves the Ad .E-part as it is: 

*(/x <g) uj) = fi ® *uj ior p € r(AdE),u; g f2 p , 

and by Lemma 2.1.4 

d* = (-1)Ap+i)+i . 

Moreover, A, and * commute, since A t operates on the Ad E-part and * on 
the form part. In particular, 

*A,;* = Aj. 


Thus, from (3.2.5) 

D* = (-1)Ap+i)+i * ( d + A)* = (_i)d( P +i)+i * . (3.2.6) 

(Note, however, that A operates on the form part by contraction and not by 
multiplication with dx l ). 

In Chapter 7, we shall need to compute expressions of the form 


where ip is a section of a vector bundle E with a metric connection D. We 
obtain 


A(p,p) = d*d(p,p{ 

= (—1) * d* d(ip, p) 

= 2(— 1) * d* (Dip, ip) 
= 2(— 1) * d* (Dip, ip) 


= 2(— 1) * ((D * Dp, p) + (*Dp, Dp)) 
= 2 ((D*Dp,p)-(Dp,Dp)) 


since D is metric 
(Dp is a 1-form with 
values in E, and * 
operates on the form part 
whereas (.,.) multiplies 
the vector parts, and so 
* and (.,.) commute) 
sinde D is metric 
by (3.2.6), and since 
** = 1 on 2-forms. 


Thus, we obtain the formula 

A(p, p) = 2((D*Dp, p) - (Dp, Dp)). (3.2.7) 

We now study the curvature of metric connection and observe first 


Corollary 3.2.1 Let D = d + A be a metric connection on E. Then the 
curvature F of D satisfies 
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F € C 2 (Ad E). 


Proof. We consider (3.1.25). Under the conditions of Lemma 3.2.2, 


dA 

dxi 


dA 

dx l 


+ [A ,Af\ 


is a skew symmetric matrix for each pair ( i,j ), because the Lie bracket of 
two skew symmetric matrices is skew symmetric again, since o(n) is a Lie 
algebra. □ 


Note that Fij = | ^ + [Ai AY) is always skew symmetric in i 

and j. This is also expressed by Corollary 3.1.1. By way of contrast, Corollary 
3.2.1 expresses the skew symmetry of the matrix 


R 


k 

Uj 


w.r.t. the indices k and t : 


Corollary 3.2.2 For a metric connection, 

R Uj = - R kij for all i,j £ {1, . . . , dj, k,£ £ {1, . . . , n} (3.2.8) 
(d = dim M, n = rank of E). □ 


For A,B £ o(n), we put 

A- B = ~tr(AB). (3.2.9) 

This is the negative of the Killing form of the Lie algebra o(n). (3.2.9) defines 
a (positive definite) scalar product on o(n). (3.2.9) then also defines a scalar 
product on Ad E. We now recall that we also have a pointwise scalar product 
for p-forms: For lu i,U 2 £ A P T*M we have 

(wi, w 2 ) = *(<^i A *w 2 )i (3.2.10) 

cf. Lemma 2.1.2. Thus, we also have a scalar product for (j\ 0 o>i, /i 2 0 w 2 £ 
Ad E x 0 A p Tf M, namely 

(pi 0 wi,/x 2 0 oj 2 ) ■= Mi • M2(wi,w 2 ). (3.2.11) 

Thus, by linear extension, we also obtain a scalar product on Ad E x <g>A p TfM. 
This in turn yields an L 2 -scalar product on f2 p (Ad E) : 

(gi0Wi,/j 2 ®w 2 ) := y"(Mi 0 Wi,M2 <S>w 2 ) * (1), (3.2.12) 

M 

assuming again that AI is compact and oriented. 
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Definition 3.2.2 Let M be a compact, oriented Riemannian manifold, E 
a vector bundle with a bundle metric over M, D a metric connection on E 
with curvature Fjj G fi 2 (Ad E). The Yang-Mills functional applied to D is 

YM(D) := (F D ,F D ) = J (F d ,F d ) * (1). 

M 

We now recall that the space of all connections on E is an affine space; the 
difference of two connections is an element of l? 1 (End I?) . Likewise, the space 
of all metric connections on E is an affine space; the difference of two metric 
connections is an element of f2 1 (AdE). If we want to determine the Euler- 
Lagrange equations for the Yang-Mills functional, we may thus use variations 
of the form 

D + tB with B G D 1 (AdE). 

For cr G F(E) = D°(E), 

FD+tB(cr) = (D + tB)(D + tB)a 

= D 2 a + tD(Ba) + tB A Da + t 2 (B A B)a (3.2.13) 
= ( F d + t(DB) + t 2 (B A B))a, 

since D(Ba) = ( DB)a — BA Da (compare the derivation of (3.1.25)). 
Consequently 

j AT 

—YM(D + tB) | t=0 = ^ J (F D+ t B ,F D+tB ) * (l)| t =o 

= 2 J(DB,F d )*(1). (3.2.14) 

Recalling the definition of D* (3.2.4) (3.2.14) becomes 
j f YM(D + tB ) , t=0 = 2 (B, D*F d ). 

Thus, I? is a critical point of the Yang-Mills functional if and only if 

D*F d = 0. (3.2.15) 


Definition 3.2.3 A metric connection D on the vector bundle E with a 
bundle metric over the oriented Riemannian manifold M is called a Yang- 
Mills connection if 

D*F d = 0. 

We write F D = F^dx 1 A dud . and we want to interpret (3.2.5) in local coor- 
dinates with gij (x) = Sij. In such coordinates, 
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and from (3.2.5) hence 

(3.2.15) thus means 
dF - 

~Q^- + [Ai,Fij\=0 iorj = 1, ... ,d. (3.2.16) 

We now discuss gauge transformations. 

Let A again be a vector bundle with a bundle metric. Aut ( E ) then is the 
bundle with fiber over x £ M the group of orthogonal self transformations of 
the fiber E x . 

Definition 3.2.4 A gauge transformation is a section of Aut (A). The group 
Q of gauge transformations is called the gauge group of the metric bundle E. 

The group structure here is given by fiberwise matrix multiplication, s £ Q 
operates on the space of metric connections D on E via 

s*(D) := T'oDos, 

i.e. 

s*(D)n = s-'Disn) (3.2.17) 

for /i £ r(E). For D = d + A, we obtain as in the proof of (3.1.16) 

s*(A) = s~ 1 ds + s -1 As. (3.2.18) 

Subsequently, this notion will also be applied in somewhat greater generality. 
Namely, if the structure group of E is not necessarily SO(n) , but any subgroup 
of G1(,R), we let Aut(A) the bundle with fiber given by G, and operating on 
E again by conjugation. The group of sections of Aut (A) will again be called 
the gauge group. 

Given Xq £ M, we may always find a neighborhood of U of xo and a 
section s of Aut (A) over U, i.e. a gauge transformation defined on U, such 
that 

s*(A)(x 0 ) = 0 

Namely, according to (3.2.18), we just have to solve 
s(x 0 ) = id, ds(x 0 ) = -A{x 0 ). 

This is possible since A £ 12 1 (AdA), and the fiber of AclA is the Lie algebra 
of the fiber of Aut (A), a section of which s has to be. Thus, 

Lemma 3.2.3 Let D be a connection on the vector bundle A over M. For any 
Xo £ M, there exists a gauge transformation s defined on some neighborhood 
of xq such that the gauge transformed connection s*(D) satisfies 
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s*(D ) = d at Xo. 

Of course, the gauge transformation can always he chosen to be compatible 
with any structure preserved by D, in particular a metric. 

Proof. □ 

In the same notation as in the derivation of (3.1.16), s as a section of 
Aut(-E) transforms as 

$0 — (3.2.19) 

The curvature F of D transforms as in (3.2.17): 

s*F = s~ 1 oFos. (3.2.20) 

An orthogonal self map of E is an isometry of (•,•), and hence 

(s*F, s*F) = (F, F). (3.2.21) 


We conclude: 

Theorem 3.2.1 The Yang-Mills functional is invariant under the operation 
of the gauge group Q . Hence also the set of critical points ofYM, i.e. the set of 
Yang-Mills connections, is invariant. Thus, if D is a Yang-Mills connection, 
so is s*D for s £ Q. □ 

Corollary 3.2.3 The space of Yang-Mills connections on a given metric 
vector bundle E of rank >2 is infinite dimensional, unless empty. □ 

For n > 2,0 (n) is nonabelian. Thus, by (3.2.18), in general not only 
s _1 As ^ A, but by (3.2.20) also 

s*F ± F. 

It is nevertheless instructive to consider the case n — 2. o(2) is a trivial Lie 
algebra in the sense that the Lie bracket vanishes identically. Ad E thus is 
the trivial bundle Mxl. Consequently for D = d + A 

F = dA. (3.2.22) 

Similarly, the Bianchi identity (Theorem 3.1.1) becomes 

dF = 0, (3.2.23) 

and the Yang-Mills equation (3.2.15) becomes 

d*F = 0. (3.2.24) 

(3.2.22) does not mean that the 2-form F is exact, because (3.2.22) depends 
on the local decomposition D = d + A which in general is not global. That 
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F, as the curvature of a connection, satisfies the Bianchi identity, does mean, 
however, that F is closed. F then is harmonic if and only if I? is a Yang-Mills 
connection, cf. Lemma 2.1.5. Thus, existence and uniqueness of the curvature 
of a Yang-Mills connection are consequences of Hodge theory as in 2.2. Thus, 
Yang-Mills theory is a generalization (nonlinear in general) of Hodge theory. 
We now write (for n = 2) s £ Q as s = e u . Then 

s*(A)=A + du by (3.2.18). (3.2.25) 

(3.2.24) becomes 

d*dA = 0. (3.2.26) 

If we require in addition to d*dA = 0 the gauge condition 


d*A = 0, 


(3.2.27) 


we obtain the equation 

AA= (d*d + dd*)A = 0. (3.2.28) 

Without the gauge fixing (3.2.27), if A is a solution of the Yang-Mills equa- 
tion, so is 

A + a with a £ Q 1 , da = 0, 

and conversely, this way, knowing one solution, one obtains every other one; 
namely, if A + a with a £ l? 1 is a solution, we get d*a = 0, hence as in 2.1 
da = 0. If H 4 (M, K) = 0, for each such a , there exists a function u with 
a = du. With s = e“, we put 

s*(A) = A + a, 

and thus, in this case Q operates transitively on the space of Yang-Mills 
connections. 

We now consider the case d = 4 which is of special interest for the Yang- 
Mills equations. As always, M is compact and oriented and carries a Rieman- 
nian metric. * then maps A 2 T*M into itself: 

* : A 2 T*M -> A 2 T*M [x £ M). 

Since by Lemma 2.1.1, ** = 1, we obtain a decomposition 

a 2 t*m = a + @a- 

into the eigenspaces of * corresponding to the eigenvalues ±1. A 2 T*M is 
of dimension 6, and A + and A~ are both of dimension 3. Choosing normal 
coordinates with center x, A + is generated by 

dx 1 A dx 2 + dx 3 A dx A 
dx 1 A dx 4 + dx 2 A dx 3 
dx 1 A dx 3 — dx 2 A dx 4 
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and A by 

dx 1 A dx 3 + dx 2 A dx 4 
dx 1 A dx 2 — dx 3 A dx 4 
dx 1 A dx 4 — dx 2 A dx 3 . 

The elements of A + are called selfdual, those of A~ antiselfdual. 


Definition 3.2.5 A connection D on a vector bundle over an oriented four di- 
mensional Riemannian manifold is called (anti) self dual or an (anti)instanton 
if its curvature Fjj is an (anti)selfdual 2-form. 


Theorem 3.2.2 Each (anti) self dual metric connection is a solution of the 
Yang-Mills equations. 


Proof. The Yang-Mills equation is 

D*F = 0. 

By (3.2.6), this is equivalent to 

D*F = 0. (3.2.29) 

Let now F be (anti)selfdual. Then 

F = ± * F. (3.2.30) 


(3.2.29) then becomes 


D **F = 0, 


hence by ** = 1, 

DF= 0. 


This, however, is precisely the Bianchi identity, which is satisfied by Theorem 
3.1.1. ' □ 


In order to find a global interpretation of Theorem 3.2.2 in terms of the 
Yang-Mills functional, it is most instructive to consider the case of U(m) 
or SU(to) connections instead of SO(n) connections. The preceding theory 
carries over with little changes from SO(?r) to an arbitrary compact subgroup 
of the general linear group, in particular U(m) or SU(rn). We shall also need 
the concept of Chern classes. For that purpose, let E now be a complex vector 
bundle of rank m over the compact manifold M , D a connection in E with 
curvature F = D 2 : f2° — > Q 2 (E). We also recall the transformation rule 
(3.2.28): 

F a = PpaFpipap (3.2.31) 

which allows to consider F as an element of Ad E\ at the moment, the struc- 
ture group is Gl(?n,C) (as E is an arbitrary complex vector bundle), and 



3.2 Metric Connections. The Yang-Mills Functional 125 


so Adi? = EndE = Homc(E,E). We let M m denote the space of complex 
in x m-matrices, and we call a polynomial function, homogenous of degree k 
in its entries, 

P ; M m — > C, 

invariant if for all B £ M m , ip £ Gl(m,C) 

P(B) = P{p~ l Bp). 

Examples are the elementary symmetric polynomials P J {B ) of the eigenval- 
ues of B. Those satisfy 

m 

det (B + fid) = P m ~ k {B)t k (3.2.32) 

k—0 

Similary, a /c-linear form 

P : M m x ... x M m -> C 

is called invariant if for B \, . . . , Bj. £ M m , p £ Gl(m, C) 

P{B 1 , ...,B k ) = P(ip~ 1 B 1 <p, . . . , ip~ 1 B k p). 

The infinitesimal version of this property is that for all Bi, . . . ,B k £ M m , 
A £ g[(m, C) 

k 

Y l P(B 1 ,...,[A,B i ],...,B k )= 0 (3.2.33) 

i=l 

Restricting an invariant fc-form to the diagonal defines an invariant polyno- 
mial 

P(B) = P(B,...,B) 

Conversely, given an invariant polynomial, we may obtain an invariant fc-form 
by polarization: 

k ^ 

P( Bl ,...,B k ):=t^Y, E (-1 ) j P(B il +... + B i .). 

J=1 ii<...<ij 

Given an invariant polynomial P of degree k, we may use the transformation 
rule (3.2.31) for the curvature F of a connection D to define 

P(F) := P(F a ), 

using any local trivialization. P(F) then is a globally defined differential form 
of degree 2k. In particular, P(F) remains invariant under gauge transforma- 
tions, as those transform F into s -1 o F o s, cf. (3.2.20) 

Lemma 3.2.4 For an invariant polynomial of degree k, we have dP(F) = 0. 
Consequently, P(F ) defines a cohomology class [^(E)] £ H 2k (M), and this 
class does not depend on the chosen connection. 
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Proof. Let P be an invariant fc-form with P(B , . . . , B) = P(B) as above. As 
explained in §3.1, we may extend D as 

D : i? p (End£) -► 12 p+ 1 (End£). 

Since P is linear, we have 

dP{B u ...,B k ) = ^(-l) Pl+ -’ Pi -iP( J B 1 , . . . , dB t , ..., B k ) 

i 

By assumption 

P(F) = P(F, 

is invariant under gauge transformations. For any Xq £ M, Lemma 3.2.3 
means that after applying a local gauge transformation, we may assume that 
at xo, we have 

d = D. 


Thus, at Xq, 

dP(F) = Y,P(F,.. u DF,...F) 

i T 

i th entry 

As Xq was arbitrary, this holds for all M. 

(Alternatively, this may also be derived from (3.2.33), without using Lemma 
3.2.3). 

The Bianchi identy DF = 0 thus implies 


dP(F) = 0. 


If Dq, D\ are connections on E, then 77 := D\ 
locally 


and we put 


Dq — d A, 


Do £ I2 1 (End E). We write 


D^ ; — Dq T tr/ — d T A T tTj. 
The curvatures thus are given by 


F t = d(A + tr/) + (A + trj) A (A + tij), 


and 


We obtain 



D t V- 


|r W ) = (=r(|F„ 

= kP {D t rj, F t , ... , F t ) 

= d(kP(r), F t ,. . . , F t )) as D t F t = 0 by the Bianchi identiy. 
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Therefore 

1 

P(F 1 )-P(F 0 ) = J | P(F t )dt 

o 

is cohomologous to zero. □ 

Definition 3.2.6 The Chern classes of E are defined as 

°j{E)= €HV(M) 

where Pi is the j th elementary symmetric polynomial, and F is the curvature 
of an arbitrary connection on E. 

Recalling (3.2.32), we have 

m 

= y dCm - k (E)t k , 

k—0 

or with the eigenvalues A a of ^ F (the are 2-forms) and r := t _1 , 

ra / . \ m 

Y cj(E)t J = det ( —tF + Id ) = JJ(1 + Aq,t) (3.2.34) 

j=0 ' ^ ' a— 1 

In particular, we have 

r, (/•.'! - l~\ yF (3.2.35) 

2,77 

Tin — 1 1 

°2(E) - (E) A Cl (E) = ^tr (F 0 A F 0 ), (3.2.36) 

where 

F 0 := F — • Id# is the trace free part of F. (3.2.37) 

TO 

We now return to the situation of a U (to) vector bundle E over a four dimen- 
sional oriented Riemannian manifold M. We let D be a unitary connection 
on E with curvature F = D 2 as usual. We decompose F 0 into its selfdual 
and antiselfdual components 


det I — F + ild 

\ 27T 


F 0 = F+ + F 0 -. 


(3.2.38) 
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Then 


tr(F 0 A F 0 ) = tr(F 0 + A Fq) + tr(F 0 A F 0 ) 

Since the A product of a selfdual 2- 
form with an antiselfdual one always 
vanish (This can be seen from the 
above generators of A + and A~). 

= trOPo + A *F+ ) - tr(F 0 " A *F 0 " ) 
since *F± = ±F± 

= -|^o + | 2 + \ F o\ 2 cf. (3.2.9) (3.2.39) 

Recalling (3.2.36), we conclude that integrating over M yields 

(C2 (E) - ^c l( F) 2 )[M] = ~^J - |F 0 -| 2 ) * (1) (3.2.40) 

The Yang-Mills functional decomposes as 

YM{D) = J (^|trF| 2 + |F 0 | 2 ) *(1) 

M 

= j (^|trF| 2 + |F 0 +| 2 + |F 0 "| 2 ) * (1) (3.2.41) 

M 

Since trF represents the cohomology class — 27rici(F), the cohomology class 
of trF is fixed, and 

J |trF| 2 * (1) 

M 

becomes minimal if trF is a harmonic 2-form in this class, see §2.1. J |trF| 2 
and f |Fo| 2 may be minimized independently, and because of the constraint 
(3.2.40), /|F 0 | 2 becomes minimal if, depending on the sign of (c 2 (F) — 
^ Cl (F) 2 )[M], 

F+ = 0 or Fq = 0 (3.2.42) 

i.e. if Fq is antiselfdual or selfdual. 

If D is a SU(?n) connection, then the fiber of AdF is su(m) which is 
tracefree, and thus F € l7 2 (AdF) satisfies 

trF = 0 (3.2.43) 


Hence, by (3.2.35) 


ci{E) = 0, 
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and by (3.2.36), (3.2.40) 

c 2 {E)[M\ = J (|F+| 2 - |F"| 2 ) * (1) 

M 

where F* are the (anti)selfdual parts of F. 

Also, 

YM(D) = J (|F+| 2 + |F-| 2 ) *(1) 

m 

then is minimized if F is (anti)selfdual, again depending on the sign of 
c 2 (E)[M\. In conclusion we obtain 

Theorem 3.2.3 Let E be an SU(m) vector bundle over the compact oriented 
four dimensional manifold M. Then an SU(m) connection D on E yields an 
absolute minimum for YM if F is antiselfdual or self dual (depending on the 
sign c 2 (E)[M]), i.e if it satisfies the first order equation F — ± * F . 

Remark. Here, we do not address the question when the lower bound for 
the Yang-Mills functional just derived is achieved, i.e. when there exist (anti) 
selfdual connections. 

The Yang-Mills functional exhibits special features in dimension 4, as we 
have seen. There is also a functional that is well adapted to 3 dimensional 
manifolds, namely the Chern-Simons functional that we shall now briefly 
discuss. 

Let M be a compact 3 dimensional differentiable manifold, and let E be a 
vector bundle over M with structure group a compact subgroup G of Sl(n, R), 
with Lie algebra 0 as usual. We consider G-connections D, i.e. connections 
that can locally be written as 

D = d + A, with A £ L? 1 (g) . 

(As before, we identify g with the fibers of AdF, the endomorphisms of the 
fibers of E that are given by elements of g. The discussion here is a little 
more general than the one we presented in the 4 dimensional case, but the 
latter can easily be extended to the present level of generality as well.) 

We also suppose that E is a trivial G-bundle, i.e. as a vector bundle, E 
is isomorphic to M x R”, and the connection on E given by the exterior 
derivative d preserves the G-structure (e.g. if G = SO(n), and (-,■) is the 
corresponding metric on the fibers, then for any two sections tri, a 2 of E (that 
are considered as functions <ti,<T 2 : M — > R" under the above isomorphism), 
we have 


d(<j 1 ,a 2 ) = (d<Ji,a 2 ) + (a 1 ,dtj 2 ))- 
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In this case, for any other G-connection 

D = d + A 

on E, A is a globally defined 1-form with values in g. 


Definition 3.2.7 The Chern-Simons functional of A is defined as 


CS(A) = J tr ^AAcL4+ A A A 

M 


(3.2.44) 


(Here, tr of course is the trace in g, or in more general terms, the negative of 
the Killing form of g. In fact, one may take any Ad invariant scalar product 
on g here.) 


Remark. Without the assumption that A is a trivial G-bundle, we need to 
choose a base connection D$ = d + Aq. For D = d + A, A — Aq then is a 
globally defined 1-form with values in g, and we may thus insert A — A 0 in 
place of A in the definition of CS. 


An important observation is that for the definition of CS, we do not 
need to specify a Riemannian metric on M as the integrand is a 3-form on 
a 3-dimensional manifold. Thus, any invariants constructed from the Chern- 
Simons functional will automatically be topological invariants of the differ- 
entiable manifold M. 

In order to compute the Euler-Lagrange equations for CS, we consider 
variations A+tB, B £ I2 1 (g), as in the derivation of the Yang-Mills equations. 
We have 


f t CSU + ,B) k = 0 


-- J tr (B A dA + A A dB + 2B A A A A) 

(cf. (3.2.13) and use, with A = Aidx 1 , 
B = B i dx l tr(A A B A A) = tr (A k dx k A 
Bidx 1 A Ajdxi ) = tr {Bidx 1 A Ajdx J A 
Akdx k ) = tr(RAAAA) and similary for 
tr(A AiAB), as the trace is invariant 
under cyclic permutations) 


= 2 


tr(RA (dA + AaA)) 


(use / tr ( Aidx 1 A §§^dx^ A dx k ) = f tr 
( B k dx k A | ^fdx J A dx 1 )) 



tr (B A F a ), 


(3.2.45) 
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where Fa = dA + A A A is the curvature of the connection D = d + A. If this 
expression vanishes for all variations B £ f2 1 (o), then Fa = 0. Consequently, 
the Euler-Lagrange equations for CS are 

F a = 0 (3.2.46) 

i.e. A is a flat G-connection on E. 

Like the Yang-Mills equation, the equation (3.2.46) obviously remains 
invariant under gauge transformations. The equation (3.2.46) also arises as a 
reduction of the (anti)selfduality equations to 3 dimensions. Namely, suppose 
that M is a 3-dimensional oriented Riemannian manifold, and that we have 
a selfdual connection D = d + A on the 4-dimensional manifold 

N = M x I 


with the product metric, and that D = d + A can be written locally as 
d T A\dx^ T A 2 dx~ T A 3 dx 3 , 

where ar, x 2 , x 3 are coordinates on M and where Ai, A 2 , A 3 are functions 
of the x 1 , x 2 , x 3 only, and independent of the R-direction. Thus, we assume 
that D is trivial in the direction of the factor R. We denote the coordinate 
in that direction by x 4 . We write, in our coordinates, the curvature of D as 

F = Fi~dx 1 A dx J = ( — — — —4 + [Aj, A.l ) dx l A dxF 

\ OX 1 OXl J J 

Our assumption implies that 

Fn = 0 = Fij for all i, j. (3.2.47) 

On the other hand, if x 1 , x 2 , x 3 now are normal coordinates at the point of 
M under consideration, the selfduality equations become 


F12 — F 3 4 , F13 — T24 5 F\a — F23. 


(3.2.47) and (3.2.48) imply 


F = 0 


i.e. D = d + A is flat. 


(3.2.48) 


Perspectives. In the work of Donaldson, detailed accounts of which can be found 
in [79], [61], instantons were introduced as important tools for the study of the dif- 
ferential topology of four-dimensional manifolds. Let M be a compact differentiable 
four-manifold. As explained in 2.2, one has a natural pairing 

r : H 2 (M) x H 2 (M) — ► R 

(a, d) 1— > J a A /3. 

M 


r is called intersection form of M. 
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Donaldson showed that if M is simply connected (tti(M) = {1}) and if F is 
definite, then for a suitable basis of Ft 2 (Ad), F is represented by ± identity matrix. 
Since by the work of M. Freedman, there exist simply connected compact four- 
dinrensional manifolds with definite intersection form not equivalent to ± identity 
matrix, it follows that such manifolds cannot carry a differentiable structure, or 
in other words that there exist restrictions on the topology of compact, simply 
connected differentiable four-dimensional manifolds that are not present for non- 
differentiable ones. The crucial ingredient in the proof of Donaldson’s theorem is 
the moduli space 9 71 of instantons on a vector bundle over M with structure group 
SU (2) and with so-called topological charge 

^/tr(FAF )- 1 

M 

for the curvature F of a 5t/(2)-connection. As explained, the topological charge 
is a topological invariant of the bundle and does not depend on the choice of SU 
(2)-connection (it is the negative of the second Chern class of the bundle). In order 
to construct the moduli space of instantons, one identifies instantons that are gauge 
equivalent, i.e. differ only by a gauge transformation (see Theorem 3.2.1). Donaldson 
then showed that under the stated assumptions, 9ft is an oriented five-dimensional 
manifold with point singularities, at least for generic Riemannian metrics on M. 
Neighborhoods of the singular points are cones over complex projective space CP 2 
(see 5.1 below), and M itself is the boundary of 9ft. Deleting neighborhoods of 
the singular points, one obtains a smooth oriented five-dimensional manifold with 
boundary consisting of M and some copies of CP 2 . Therefore, in the terminology 
of algebraic topology, M is cobordant to a union of CP 2 ’s, and one knows that M 
then has the same intersection form as this union of CP 2 ’s. As will be demonstrated 
in 5.1, ff 2 (CP 2 ,K) = K, and the intersection form of CP 2 is 1. These facts then 
imply Donaldson’s theorem. The main work in the proof goes into deriving the 
stated properties of the moduli space 9 ft. In particular, one uses a theorem of 
Taubes on the existence of self-dual connections over four-manifolds with definite 
intersection form. 

Donaldson then went on to use the topology and geometry of these moduli 
spaces to define new invariants for differentiable four-manifolds, the so-called Don- 
aldson polynomials. These invariants greatly enhanced the understanding of the 
topology of differentiable four-manifolds. Subsequently, however, there has been 
found a simpler approach to this theory that is based on coupled equations for a 
section of a spinor bundle and a connection on an auxiliary bundle with an abelian 
gauge group, namely U(l). This will be explained in Chapter 7. 


3.3 The Levi-Civita Connection 

Let M be a Riemannian manifold with metric (•, •). 

Theorem 3.3.1 On each Riemannian manifold M, there is precisely one 
metric and torsion free connection V (on TM). R is determined by the for- 
mula 

(VxY, Z) = \{X(Y, Z) - Z(X, Y) + Y(Z , X) 

- (X, [Y, Z\) + (Z, (X, Y]) + (Y, (Z, X])} 


(3.3.1) 
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Definition 3.3.1 The connection V determined by (3.3.1) is called the Levi- 
Civita connection of M. 

In the sequel, V will always denote the Levi-Civita connection. 

Proof of Theorem 3. 3. 1 We shall first prove that each metric and torsion 
free connection V on TM has to satisfy (3.3.1). This will imply uniqueness. 
Since V should be metric, it has to satisfy: 

X(Y, Z) = (V*Y, Z) + (Y, X x Z) 

Y(Z, X) = (X Y Z, X) + (Z, X Y X) 

Z{X,Y) = (V z X,Y) + {X,V z Y) . 

Since V should also be torsion free, this implies 

X{Y, Z) - Z(X, Y) + Y{Z, X) 

= 2 (VjcY, Z) - ([X, Y], Z) + (Y, [X, Z]) + (X, [Y, Z ]) , 

i.e. (3.3.1). 

For the existence proof, for fixed X, Y, we consider the 1-form to assigning 
the right hand side of (3.3.1) to each Z. u>(Z) is tensorial in Z, because for 
/ G C°°(M) 


»(fZ) =MZ) + ±({Xf) (Y, Z) + (Y/) {Z, X) (3.3.2) 

- (■ Xf)(Y,Z ) - ( Yf){X,Z )) = MZ), 

and the additivity in Z is obvious. 

Therefore, there exists precisely one vector field A with 

u(Z) = (A,Z>, 

since (•,•) is nondegenerate. We thus put V X Y := A. It remains to show 
that this defines a metric and torsion free connection. Let us first verify that 
V defines a connection: Additivity w.r.t. X and Y is clear, the tensorial 
behavior w.r.t. X follows as in (3.3.2), and the derivation property V x fY = 
fV x Y + X(f) is verified in the same manner. That V is metric follows 
from (3.3.1) by adding (VxY Z) and (V X Z,Y). Likewise (3.3.1) implies 
(Va'Y, Z) — (VyJ, Z) = ([A, Y],Z), i.e. that V is torsion free. □ 

As in 1.4, let the metric in a local chart be given by {gij)i,j= i,...,d- The 
Christoffel symbols of the Levi-Civita connection V then are 

V d - r k d 

sTf dxi dx k ’ 


i,j = l,...,d. 


(3.3.3) 
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From (3.1.21), we then get 

S7j^dx j = ~r 3 ik dx k . (3.3.4) 

dx z 

Corollary 3.3.1 For the Levi-Civita connection, we have 
f \j = 2 5 ( 9i(,j + 9ji,% — gtj,e )• 

Thus, the Christoffel symbols coincide with those defined in l.f. Likewise, the 
two concepts of geodesics (from l.f and 3.1) coincide. In particular, 

r* = F % for all i,j, k . 

Proof. 

pk k( pm I ^ ^ \ kt lx-j ® ^ \ 

^ dx m ’ dx i Ftx dxl ’ dx e 

1 kt f d 9 d 1 , , 

= + w 9 -'/ by (331) - 

since the Lie brackets of coordinate vector fields vanish. □ 

We now want to exhibit some formulae for the curvature tensor R of the 
Levi-Civita connection V. R is given by 


R{X, Y)Z = V X V Y Z - VyV. v 2 - V [x ,y]Z 


(cf. (3.1.32)). In local coordinates, as in (3.1.29), 

td I A JM JL = ofc 9 

dx i ’ dxl ) dx e h:l dx k ' 


We put 

i.e. 



(3.3.5) 


(3.3.6) 


3 We point out that the indices k and l appear in different orders at the two 
sides of (3.3.6). This somewhat unusual convention has been adopted in order 
to achieve as much conformity as possible with the - often conflicting - sign 
conventions that occur in Riemannian geometry. Differing sign conventions often 
lead to considerable confusion, and we hope that the convention adopted here 
does not add too much to that problem. 
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Lemma 3.3.1 For vector fields X.Y, Z, IF we have 

R{X, Y)Z = - R(Y,X)Z , i.e. R Uij = -R Uj i (3.3.7) 

R(X,Y)Z+R(Y,Z)X+R(Z,X)Y = 0, i.e. RkUj + Rkijt+RkjU = 0 (3.3.8) 

(R(X,Y)Z,W) = -(R(X,Y)W,Z), i.e. R kUj = -R mj (3.3.9) 

{R(X,Y)Z,W) = (. R(Z,W)X,Y ), i.e. R mj = R ijke (3.3.10) 


Proof. It suffices to verify all claims for coordinate vector fields ^ . We may 

thus assume that all Lie brackets of X,Y,Z and IF vanish. (3.3.7) then is 

Corollary 3.1.1. For (3.3.8), we observe 

R(X, Y)Z + R{Y , Z)X + R{Z, X)Y 

= XxXyZ — XyXxZ + VyVzX — VyV-yA + V^V.yF — V yV zY = 0, 

since VyZ = VzY etc. because V is torsion free. 

For (3.3.9) it suffices to show (R(X,Y)Z, Z) = 0 for all X, Y, Z, i.e. 

Rkkij = 0. This follows from Corollary 3.2.2. (3.3.10) is proved as follows: 

From (3.3.7), (3.3.8) 

(R(X, Y)Z , W) = -(R(Y, X)Z, W) 

= {. R(X,Z)Y,W ) + (R(Z, Y)X, IF), (3.3.11) 

and from (3.3.8), (3.3.9) 

(R(X, Y)Z , W) = -<i2(A, y) w, z> 

= (i?(F, IF)A, Z) + { R{W. , A)F, Z). (3.3.12) 

From (3.3.11) and (3.3.12) 

2(i?(A, y)Z, IF) = (R(X, Z)Y. , IF) + (i?(Z, F) A, IF) 

+ (1?(F, IF) A, F) + (1?(IF,A)F,F). (3.3.13) 

Analogously, 

2(i?(Z, W)X, Y) = (. R(Z , A) IF, y) + (i?(A, IF)Z, F) 

+ (i?(IF, F)Z, A) + (i?(y, F)IF, A) 

= 2(R(X, Y)Z, W ) , 

□ 


by applying (3.3.7) and (3.3.9) to all terms. 
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Remark. (3.3.7) holds for any connection, (3.3.8) for a torsion free one, and 
(3.3.9) for a metric one. 

(3.3.8) is called the first Bianchi identity. 

Lemma 3.3.2 (Second Bianchi identity): 

d d d 

-Q^RkUj + -Q^Rchij + -^jRhkij = 0. (3.3.14) 

Proof. This is a special case of Theorem 3.1.1. We want to exhibit a dif- 
ferent method of proof, however. Since all expressions are tensors, in or- 
der to prove (3.3.14) at a point Xo £ M, we may choose arbitrary coor- 
dinates around x$. We thus choose normal coordinates with center xo, i.e. 
9ij{ x 0 ) = Sij,g ijt k(xo) = 0 = r£j(x 0 ) for all i,j,k. 

From (3.1.30), we obtain at xq 

1 

Rktij — 2^\9jk : H 4” ijtk.ij 9j£,ki fl'ik.fj [Jik/i/j T [Jijt.kj ) 

2 (djkJA 4” 9i£,kj 9j£,ki 9ik,Zj)t (3.3.15) 

hence also 

RkUj,h = ^ {9jk,Uh 4” 9i£,kjh 9j£,kih 9ik,£jh)t 

since all other terms contain certain first derivatives of g.ij , hence vanish at 
Xq. Thus 

RkUj.h 4” Rphij .k 4“ R'hkijJ. — 2 ( fj;l k.Hh 4“ 9i£,kjh 9j£,kih [Jik.tjh 

4“ 9j£,hik 4” 9i h , f.j k [ijhj/tk 9i£,hjk 
4 ” 9jh,kii 4 ” [itk.h / f: [Jjk.hif: [J i h , kj ) 

= 0 . □ 

Formula (3.3.15) is often useful. 

Definition 3.3.2 The sectional curvature of the plane spanned by the (lin- 
early independent) tangent vectors X = £ T X M of the 

Riemannian manifold M is 

K(X A Y) := (R(X,Y)Y,X) 1 

9ikgje{f' l f k v j 'n e - f i f J v k v e ) 

= RiMWtW 

{9ik9jt - 9ij9kt )£V£*V 
(|X Ay| 2 = (X,X)(Y,Y)-(X,Y) 2 ). 


(3.3.16) 
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Definition 3.3.3 The Ricci curvature in the direction X = G T X M is 

O r\ 

Bic(X,X) = g ji (R(X, —)— v X). (3.3.17) 

The Ricci tensor is 

R,k = 9 je RijU ■ (3.3.18) 

From (3.3.10) and (3.3.18) we get the symmetry 

Rik = Rki ■ (3.3.19) 

Finally, the scalar curvature is 

R = g ik R ik - 

Thus, the Ricci curvature is the average of the sectional curvatures of 
all planes in T X M containing X, and the scalar curvature is the average of 
the Ricci curvatures of all unit vectors, i.e. of the sectional curvatures of all 
planes in T X M. 

Lemma 3.3.3 With K(X,Y) := K(X A Y)\X A Y\ 2 {= (R(X, Y)Y, X)), we 
have 

(R(X, Y)Z , W) = K(X + W,Y + Z)- K(X + W,Y)~ K(X + W, Z) 

- K(X, Y + Z ) - K{W, Y + Z) + K(X, Z) + K(W, Y) 

- K(Y + W, X + Z) + K(Y + W,X)+ K(Y + W, Z) 

+ K(Y, X + Z) + K(W, X + Z)- K(Y, Z) - K(W, X). 

Thus, the sectional curvature determines the whole curvature tensor. 

Proof. Direct computation from Lemma 3.3.1. □ 

For d = dimM = 2, the curvature tensor is simply given by 

Rijkt — R ([iikOjf. gijgke), (3.3.20) 

since T X M contains only one plane, namely T X M itself. The function K = 
K(x ) is called the Gauss curvature. 

Definition 3.3.4 The Riemannian manifold M is called a space of constant 
sectional curvature, or a space form if K{X A Y") = K = const, for all linearly 
independent X, Y G T X M and all x € M. A space form is called spherical , 
flat , or hyperbolic , depending on whether K > 0,= 0, < 0. 

M is called an Einstein manifold if 

Rik = cg ik , c= const. 

(note that c does not depend on the choice of local coordinates). 



138 3. Parallel Transport, Connections, and Covariant Derivatives 


From Lemma 3.3.3 and Theorem 3.1.3, we see that the Riemannian man- 
ifolds of vanishing sectional curvature, the flat ones, are those that are locally 
isometric to Euclidean space, that is, possess local coordinates for which the 
coordinate vector fields -gR are parallel and by a linear transformation can 
then be chosen to satisfy 

_ ,_d <9_, r 

9 ii ~ { dx^ dxi } - ij - 

Theorem 3.3.2 (Schur) Let d = dim M > 3. If the sectional curvature of 
M is constant at each point, i.e. 

K(X A Y) = f(x) for X,Y G T X M , 

then f(x) = const., and M is a space form. 

Likewise, if the Ricci curvature is constant at each point, i. e. 

R%k — c{a^)pih-, 

then c( x) = const., and M is Einstein. 

Proof. Let K be constant at every point, i.e. I\(X AY) = f(x). From Lemma 
3.3.3, we obtain with f h = *-(/) 

Rijki — f (■£') {[liAJj k 9ik9ji) ■ 

By Lemma 3.3.2, with normal coordinates at x, we obtain 

d = Rijki, h T Rjhki,i T Rhiki,j = fh(&i£&jk dik^ji) 

A ffldjtdhk S jk 5hi) f jj (dhfdik dhkda). 

Since we assume dim M > 3, for each h, we can find h, i,j,k,£ with i = 
£,j = k,hfl=i,hfl^j,ij£ j. It follows that 0 = fh- Since this holds for all 
x G M and all h, we recall that M is connected by our general convention 
and conclude / = const. 

The second claim follows in the same manner. □ 

Schur’s theorem says that the isotropy of a Riemannian manifold, i.e. the 
property that at each point all directions are geometrically indistinguishable, 
implies the homogeneity, i.e. that all points are geometrically indistinguish- 
able. In particular, a pointwise property implies a global one. 

Example. We shall show that S n has constant sectional curvature, when 
equipped with the metric of 1.4, induced by the ambient Euclidean met- 
ric of K n+1 . The reason is simply that the group of orientation preserving 
isometries of S n , SO(n+ 1), operates transitively on the set of planes in TS n , 
i.e. can map any plane in TS n into any other one. This is geometrically ob- 
vious and also easily derived formally: First of all, we have already seen that 
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SO (n + 1) operates transitively on S n . It thus suffices to show that for any 
point p, e.g. p = (1, 0, . . . 0), SO(n + 1) maps any plane in T p S n onto any 
other one. The isotropy group of p = (1, 0 ... 0) is 

(o a) with A £ SO (n) 

(here, the zeroes are (l,n) and (n, 1) matrices). 

W.r.t. the Euclidean metric, T p S n is orthogonal to p, and SO(n+ 1) thus 
operates by X n AX on T p S n , and this operation is transitive on the 2- 
dimensional planes in T p S n . Since curvature is preserved by isometries it 
indeed follows that S n has constant sectional curvature. 


We want to consider the operation of the covariant derivative V of Levi- 
Civita on tensor fields once more. For a 1-form uj and vector fields X, Y, as 
in 3.1 

*(w(Y)) = (V x w)(y) + w(V A -Y). (3.3.21) 

Next, as in 3.1, for arbitrary tensors S,T 


V x (S®T) = V.y5®T + 5®V. y T. (3.3.22) 


If e.g. S' is a p-times covariant tensor, and Yi, . . . , Y p are vector fields, 

(Va‘S)(Fi, ...,Y p ) = X(S(Y 1 , . . . , Y p )) (3.3.23) 

- 2 S(Yi, . . . , Y-i, V a -Y, Y+i, . . . , Y p ). 

i = 1 

If in particular S = gijdx 1 (g> dx J =: g is the metric tensor, we get 

Va g = 0 for all vectorfields X . (3.3.24) 

This, of course, simply expresses the fact that V is a metric connection. 

We also want to compare V with the Lie derivative of 1.6. From Theorem 
1.6.4 (notations as there), we obtain 

(L x S)(Y u ...,Y p ) = X(S(Y n . . • , Y p )) (3.3.25) 

- 2 s(y 1 ,...,Y_ 1 ,[x,y i ],Y + i,...,F p ). 

i—1 

Since V is torsion free, [X, Y. t ] = Va Y — Vy 4 X, and with (3.3.23), we obtain 
(L X S)(Y 1 ,...,Y P ) 

= (Va s)(y 1 ; ...,y p )+ 2 s(y 1 ,...,y i _ 1 ,Vy i x,...,y p ) (3.3.26) 

i = 1 
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For example, for g = g.ijdx 1 (g> dx 3 , we get 

(L x g)(Y, Z) = g(V Y X, Z) + g(Y, X Z X) (3.3.27) 

(= <' V y X,Z) + (Y,X z X )) 

From (3.3.25), we obtain for a p-form uj 

dw(Y 0 ,...,Y p ) = E(-iyL Yi (uj(Y 0 ,...,Yi,---,Y p )) (3.3.28) 

*= o 

+ s (-i) i+ W^i4..,t..j,..Jp), 

and hence 

dco(Y 0 ,...,Y p ) = E(-l yv Y MYo,---,Y t _ 1 ,%Y i+1 ,...,Y p ). (3.3.29) 

i = 0 

Lemma 3.3.4 Let e\ ,ed (d = dim M) be a local orthonormal frame field 
(i.e. e\ (j/), . . . , ed{y) constitute an orthonormal basis ofT y M for all y in some 
open subset of M). Let 77 1 , . . . be the dual coframe field (i.e. ^(e,) = Sf). 
The exterior derivative satisfies 

d = g j A V ej (3.3.30) 

and its adjoint ( cf. Definition 2.1.1) is given by 

d* = -t(e,)V e , (3.3.31) 

where t denotes the interior product ( t : L2 P (M) — * 17 P ^ 1 (M), and for u £ 
fl p (M), Yo , . . . , Y p - 1 € T y M , we Lave 

(t(y 0 )w)(yi, . . . , y„_i) = w(y 0 , y u . . . , y p _ 1 )). (3.3.32) 


Proof. (3.3.30) is the same as (3.3.29). We are going to give a different method 
of proof, however, that does not use the Lie derivative and that also gives 
(3.3.31). 

We put 

d:=r] J A V ej . 

In order to show that d= d, i.e. (3.3.30), we proceed in several steps: 

1) d does not depend on the choice of the frame field ei, . . . , e<j. 

Let fi , . . . , fd be another local frame field, with dual coframe field 
C) • • • , Then 

fj = a k jek (3.3.33) 

for some coefficients a k , and 
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with 



from the standard transformation rules. 

Consequently 

? A V fj = pj/ A V a * ek = a) p\/ A V e , 

= / A V ek . 

d is independent of the choice of frame field, indeed. 

2) Since d does not depend on a choice of frame field either (see Lemma 
1.5.2 and Corollary 1.5.1), it therefore suffices to check (3.3.30) for 
one particular choice of frame field. The independence on the choice 
of frame field of both sides of (3.3.30) will then imply that (3.3.30) 
will hold for any choice of frame field. 

3) We now choose normal coordinates (a: 1 , . . . ,x d ) centered at xq G M 
(Corollary 1.4.2) and the frame field e :j = -// which is orthonormal 
at Xq. Then / = dx k . We are now going to verify (3.3.30) at the 
point xq for those choices of e,- and /. By 2), and since xq € M is 
arbitrary, that suffices. 

At Xq . the center of our normal coordinates, we have 

d 

r = 0; \/_a_dx k = 0 for all j,k (3.3.34) 

6x3 OX K dx3 

(Theorem 1.4.4 and Corollary 3.3.1). 

Since d and d are both linear operators, it also suffices to verify the 
claim on forms of the type ip(y)dx 11 A ... A dx lp . Renumbering indices, 
it even suffices to consider the form 

( p{y)dx 1 A ... A dx p . 

Using (3.3.34), we have at Xq 

d(Lp(xo)dx l A ... A dx p ) = dad A (V_a_ip) (xo)dx 1 A ... A dx p 

K axi 

= ^/-dx^ A dx 1 A . . . dx p 
0X3 

= d{y}{xo)dx 1 A ... A dx p ) 

which is the desired formula. 

In order to verify (3.3.31), we use the same method. 

We put 

d* = -tfoOVe,. 

1) Independence of the choice of frame field: 
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Since both (fj)j=i,... t d and ( e k)k=i constitute an orthonormal ba- 
sis of T y M , the matrix {otfj)j,k=i,...,d of (3.3.33) is orthogonal, i.e. 

a)o^ = S M . 


Thus 


-^(/,)V/, = -i(a£e fc )V a j e< = -a^t(e fc )V e< (3.3.35) 

= ~ ( '(efc)V eic . 

2) By 1), it again suffices to verify (3.3.31) for one particular choice of 
frame field. 

3) We choose normal coordinates centered at xq as before, and e,- = 
£j,ri k = dx k . 

Then again at xq 

d*((p(xo)dx 1 A ... A dx p ) 

= (Jxl ^ Xo> * dxl A --- AdxP 

= (— lY (J^ip)(xo)dx 1 A ... A dxi A ... A dx p 
oxi 

where in the last expression, j only runs from 1 to p. We compare this 
with 

d*(<p(xo)dx 1 A ... A dx p ) 

= (— i) d (p+ 1 )+ 1 * d * (p(x 0 )dx 1 ... A dx p ) by Lemma 2.1.4 
= (-l) d (P+ 1 )+ 1 * d(<p(x 0 )dx p+1 A ... A dx d ) by definition of * 

= (-n^P+b+i * dx j A (V _o_p)(x 0 )dx p+1 A ... A dx d 

by (3.3.30) and (3.3.34) 

= (_ i)d(p+ 1 )+ 1 (_ i)(p-i)(rf-p+ 1 )+(p-l)v^ v dx 1 

9x3 

A ... A dxi A ... A dx d by definition of * 
= (— iy\7 ^(pdx 1 A ... A dxi A ... A dx d . 

9x3 

Thus, d* = d* . □ 

Remarks. 

1) For (3.3.30), we do not need to assume that the frame field is or- 
thonormal. It suffices that the vectors ei(y), . . . , ed(y) constitute a 
basis of T y M. Of course, this is to be expected from the fact that the 
definition of the exterior derivative does not involve a choice of metric. 
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By way of contrast, in (3.3.31) the e 3 have to be orthonormal, and of 
course, the definition of d* does depend on the choice of a metric. 

2) We may now give a proof of formula (2.1.31): 

We recall from formula (3.3.35) that we have for arbitrary (not nec- 
essarily orthonormal) bases of T y M with 

fj = ot k e k 


that 

-*(/,-) V/, = -ayjt(e k )V ek . (3.3.36) 

We now choose (fj)j=i,...,d to be orthonormal and e k = !3 J <: k w.r.t. 
local coordinates. Then of course 

(ek ’ e * } = ( ^’^ }=9ke 

and hence 

Sij = ( fi,fj ) = {a k e k ,a e :j e e ) = a’fa^gu, 

and thus 

a* at = 5 l3 g M . (3.3.37) 

From (3.3.31), (3.3.36), (3.3.37) (since (fj) is orthonormal) 

d ‘ ■ < 33 - 38 > 
Then for a = a^.^^dx 11 A ... A dx lp 


— a ii...i p rj m dx rn A dx n A dx lj A ... A dx tp ) (3.3.39) 

using (3.3.4) and thus 




= -a k U 




• f'p— l 


dx l 


^e k a J i i-- i P- 1 ) 


which is (2.1.31). 


We next want to express the Laplace-Beltrami operator A (cf. Definition 
2.1.2) in terms of the Levi-Civita connection V. For that purpose, we define 
the second covariant derivative as 

V| :y = VxVy-V Vx y. (3.3.40) 


Theorem 3.3.3 (Weitzenbock Formula). Let ei ,...,ed (d = dimM) be a 
local orthonormal frame field as in Lemma 3.3.4, with the dual coframe 
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field 77 1 , . . . ,rj d . Then the Laplace-Beltrami operator acting on p-forms ( p = 
0, 1, . . . , d) is given by 

A = -V^ - rf A i(e i )i?(e i ,e,). (3.3.41) 


Proof. We shall use invariance arguments as in the proof of Lemma 3.3.4. 
The right hand side of (3.3.4) is independent of the choice of our orthonormal 
frame field n*. Therefore, if we want to verify (3.3.41) at an arbitrary point 
xq £ M, we choose normal coordinates centered at Xq and put at xq, 

_ d_ 

6i dx i 


Then, always at Xq, 


hence 



and also [gfr, gfj] = 0, hence 


(3.3.42) 


R{ei, Bj) = V ei V ej - Ve 3 V e . (cf. (3.1.32)). 


(3.3.43) 


Using Lemma 3.3.4, we then have at Xo 

d*d=-i(e J )V ej (r 1 i AV e ,) 

= —i(ej)( rf A V ej .V ei ) since V ej r] 1 = 0 at xq 
= -V et V efc + rf A t( ej )V e .V ei . (3.3.44) 

Next 

dd* = -rf A Ve.We.OVe,) 

= -rf At(e,)V ei V e . (3.3.45) 

since at xo ,t(ej)V ei = V ei i(ej) because of V ek rf = 0. 

(3.3.42) - (3.3.45) imply (3.3.41). □ 

Remark. On functions, i.e. 0-forms /, we have 

R{ei,ej)f = fR(ei,ej) 1 = 0 
because of the tensorial property of R. 

Hence for a function / : M — » R, 

Af = ~V 2 eiei f 


(3.3.46) 
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Definition 3.3.5 The Hessian of a differentiable function f : M 
Riemannian manifold M is 


Vd/. 


on a 


We have df = §^idx l in local coordinates, hence 


df = 


d 2 f 

dxddxJ 


dx' - / 


dx i ' 


jk 


dx k 


Xdf = ( H 'ft) dx i ® dx j . 

J \ dx l dx-i dx k lJ J 

We also have 

Vd/(X, Y) = (Vx grad /, Y), 
since Y (/) = (grad /, Y) and thus 

X(Y(f)) = X( grad /, Y) 

= (Vx grad f,Y) + ( grad /, V X Y) 
= (V A -grad/,y) + (V A -F)(/), 

and applying (3.3.47) to X and Y yields 

Vdf(X,Y) = X(Y(f)) - (V x Y)(f). 


(3.3.47) 

(3.3.48) 


(3.3.49) 


This formula can be given the following geometric interpretation: Let X £ 
T p M and take a geodesic c : [0, e) — » M (for some e > 0) with c(0) = p, 
c(0) = X. Then at p 


Vdf(X,X) = ^f(c(t)) h = Q 


(3.3.50) 


Namely 


and 


X{X{f )) = c(grad/ (p ) , c) 

= 6 (s /Wt)) i« = o) 

= |j/W<))i ( = o 
V 6 c = 0, 


since c is geodesic (see (3.1.34) and Corollary 3.3.1) so that (3.3.50) follows 
from (3.3.49). 


Definition 3.3.6 The differentiable function f : M — > M is called (strictly) 
convex if the Hessian Vd/ is positive semidefinite (definite). 
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Theorem 3.3.4 Let M be a compact Riemannian manifold with metric ten- 
sor g. There then exists a constant c (depending on the geometry of M) such 
that for any (smooth) vector field X on M 


J \\S7 X\\ 2 dvol + J | div X\ 2 dvol — C (J II A|| 2 (i?;ol + J \\Lxg\\ 2 dvol ), 

M MM 

(3.3.51) 

where Lxg is the Lie derivative of g in the direction of X (see (1.6.20)). 


Proof. In local coordinates, by (1.6.20), 


Thus, 


Lxg = ( 9k j 


dX k 

dx 1 


dX k 
+ 9ik frcf 


gij t kX k )dx l (g) dx i 


(3-3, 2 , 

where, here and in the sequel, P(X,XX) stands for any terms that are 
bounded by 

const (||A||||VX|| + ||X|| 2 ). 

Now 

dx k dx i _ d k dx* ,dx k ox k dx* 

dx l dx k dx 1 dx k dx k dx k dx 1 

Also 

k £)~Y"m 

||VA|| 2 = 9kmg U -g- -foj- + P(X,XX), (3.3.54) 

|divA| 2 = ^ F |- + P(A,VA). (3.3.55) 

From (3.3.52) - (3.3.55), 

J || VAC || 2 + J\dWX\ 2 <\J \\L x g\\ 2 + J P(X,VX) (3.3.56) 

Using the inequality 

imi IIVATII < |||VAC|| 2 + |||AC|| 2 for any 5 > 0, 
we can estimate 

j P(X, XX)<e j || VA|| 2 + c(e) J ||X|| 2 , (3.3.57) 


where c{e) depends on e > 0 and on the constants involved in the terms 
P(X,VX), i.e. on bounds for the metric tensor g and its first derivatives. 
Using (3.3.57) with e = \ in (3.3.56), we easily obtain (3.3.51). □ 
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Corollary 3.3.2 Let M be a compact Riemannian manifold. Then the vector 
space of Killing fields (cf. Def. 1.6.7) on M is finite dimensional. 

Proof. By definition of a Killing field X, 

L x g = o. 

Inserting this into (3.3.51), we obtain 

J ||VX|| 2 + J | div A| 2 < c J ||X|| 2 . (3.3.58) 

MM M 

If (X n ) n6 N then is a sequence of Killing fields with /||X„|| 2 = 1 for all 
n, we bound their Sobolev H l ,2 -norm by (3.3.58), apply Rellich’s theorem 
(Theorem A. 1.8 in the Appendix), and conclude that the X n contain a 
subsequence that converges in L 2 . This implies that the space of Killing 
fields is a finite dimensional subspace of the space of L 2 -vector fields on M. 

□ 


Perspectives. The sectional curvature as an invariant of a Riemannian metric 
was introduced by Riemann in his habilitation address (quoted in the Perspectives 
on 1.1). The tensor calculus for Riemannian manifolds was developed by Christoffel, 
Ricci, and others. It also played an important role in the development of Einstein’s 
theory of general relativity. 

Levi-Civita introduced the notion of parallel transport for a Riemannian man- 
ifold. (Similar concepts were also developed by other mathematicians at about the 
time.) The concept was expanded and clarified by Weyl, see [253]. For a historical 
account, see also [219, 220]. 

Space forms are quotients of the sphere S n , Euclidean space R n , or hyperbolic 
space H n (see §4.4). They can be classified, cf. Wolf[258]. 

Einstein manifolds form an important class of Riemannian manifolds. Every two 
dimensional manifold carries a metric of constant curvature, i.e. is a space form, 
by the uniformization theorem. In higher dimensions, some necessary topological 
conditions have been found for the existence of Einstein metrics. The question which 
manifolds admit Einstein metrics is far from being solved. Even in three dimensions 
where a metric is Einstein if and only if it has constant sectional curvature, the 
question is not yet fully solved. See however [243], [244]. A comprehensive account 
of Einstein manifolds is given in the monograph [19]. 

Theorem 3.3.4 is a Riemannian version of Korn’s inequality. This result, and the 
proof of Cor. 3.3.2 given here, are taken from [48]. One may also identify the terms 
P(X, VX) in (3.3.52) in terms of the Ricci curvature to obtain the Bochner-Yano 
formula, see [25]. 
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3.4 Connections for Spin Structures and the Dirac 
Operator 

Let V be the Levi-Civita connection of the oriented manifold M of dimension 
n, according to Theorem 3.3.1. By Lemma 3.2.2, it admits a local decompo- 
sition 

X7 = d+A (3.4.1) 

with A £ f2 1 (AdTM), i.e. a one form with values in s o(n) that transforms 
according to (3.1.17). Conversely, given a vector bundle E with bundle metric 
(•,•) on which SO(?i) acts by isometries, and a one form A with values in 
so (n) that transforms by (3.1.17), then (3.4.1) can be used to define a metric 
connection on E according to the discussion in §3.2. Consequently, for any 
such bundle E on which SO(n) acts with the same transition functions as for 
the action on TM , the Levi-Civita connection induces a connection. Applying 
this observation to the Clifford bundles C1(P) and Cl c (P) from Definition 
1.8.11, we conclude that the Levi-Civita connection induces a connection, 
again denoted by V, on each Clifford bundle. 

Lemma 3.4.1 For smooth sections /i, v o/Cl(P) (or Cl c (P) y ) we have 

V(zii') = V(/i)^ + id'V(h') (3.4.2) 


Proof. It is clear that the exterior derivative d satisfies the product rule, and 
we recall that A in the decomposition (3.4.1) is in so(n), i.e. acts by the 
infinitesimal version of the SO(rc) action on C1(P). Since this SO(n) action 
extends to the one on the tangent bundle TM, B £ SO(n) acts via 

B(pu) = B(p)B(v), (3.4.3) 

and differentiating (3.4.3) yields the product rule for A. □ 

Corollary 3.4.1 V leaves the decomposition of the Clifford bundles into el- 
ements of even and odd degree invariant. 

Proof. It is clear from the definition, that subbundles of degree 0 and 1 are 
preserved, and the claim then easily follows from (3.4.2). □ 

Since the chirality operator P of Definition 1.8.3 defines a section of 
C1 C (P) that is invariant under the action of SO(n), it must be covariantly 
constant, i.e. 

Lemma 3.4.2 V(P) =0 □ 

Similarly, since the Lie algebra spin(n) can be identified with so(?r) (see 
Lemma 1.8.2), in the case of a spin structure P over M (cf. Definition 1.8.7), 
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we may use the same procedure to obtain induced connections on the asso- 
ciated spinor bundles. We denote them again by V. The action of Cl c (P) on 
the spinor bundle S„ via Clifford multiplication on each fiber (see (1.8.24)) 
is compatible with these connections; more precisely 

Lemma 3.4.3 For smooth sections p ofCl c (P), a of S„ 

V(/nr) = V(^r)cr + ^tV(cr) (3.4.4) 

(where the products of course are given by Clifford multiplication.) 

Proof. Similar to the one of Lemma 3.4.1. □ 


Suppose that in a local trivialization of TM, A from (3.4.1) is given by 
the (skew symmetric) matrix flij . We write 

A — ^ A Cj , 

i<j 

where e* Ae,,- denotes the matrix with (—1) at the place (i, j), +1 at (j, i), and 
0 otherwise. According to Lemma 1.8.3, e, A ej in so(n) corresponds to \eiej 
in spin(n). Thus, the connection on the spinor bundle w.r.t. the induced local 
trivialization is given by 

d + 2 (3.4.6) 

i<j 

Here, e^j of course operates by Clifford multiplication on spinors. 

We next consider the case of a spin c structure P over M (cf. Definition 
1.8.9). Here, the Levi-Civita connection V does not suffice to determine a 
unique connection on bundles on which Spin c acts. Namely, since the Lie 
algebra of Spin c (n) is spin(n) © u(l), we need to specify in addition a con- 
nection on the u(l) part, i.e. on the determinant line bundle L of the spin 0 
structure (Def 1.8.10). We identify the Lie algebra u(l) of U(l) with ?'R, and 
thus, a unitary connection on L is locally represented by a function iA with 
imaginary values. Given the Levi-Civita connection and such a connection 
on L, we represent the induced spin 0 connection Vyt locally as 


Va = d - 


r, j 'y ' ^ij e F 


iA 


(3.4.7) 


. i<3 


as in (3.4.6). 
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Definition ji.4.1 

(i) Let P — > M be a spin structure on the oriented Riemannian manifold 
M, with Levi-Civita connection V as explained above. The Dirac 
operator V operates on sections a of the spinor bundle §„ via 

Da(x) = ef\7 ei (a)(x) (3.4.8) 

where e,, i = 1 ,...,n, is an orthonormal basis of T X M (x £ M). 
The product on the right hand side of (3.4.8) is given by Clifford 
multiplication. 

(ii) Let P c — » M be a spin c structure on M, and let A represent a unitary 
connection on the associated determinant line bundle L. The Dirac 
operator Da operating on S n is given by 

V A a(x) = e,V/i,e,(ff)(i) 


Remark. Since V also operates on the Clifford space C1(V), = A*(V) as 
a vector space, we can also define a Dirac operator on the Clifford bundle 
instead of the spinor bundle, namely, 

D := d + d*, with d = rjj A V ej , d* = — i(ej)V ej 
as in Lemma 3.3.4 that then satisfies 

D 2 = A, the Laplacian. 

These two Dirac operators should not be confused. 


Lemma 3.4.4 The Dirac operators V and V A do not depend on the choice 
of an orthonormal frame e* . 


Proof. Any other such frame fj,j = l,...,n, can be obtained as 

Jj = bijCi 

for some B = (&ij)j>*=l,— e O(n). Then 


/j'V/j — bjie i^bjke k 

V ei 


bjibjkefS/ e k ek 

since B £ O(n) 


which is the invariance of D, and the same computation works for D A - □ 



3.4 Connections for Spin Structures and the Dirac Operator 151 


A more abstract way to express the Dirac operator is the following. Let 

cl : TM ® S — » S 

v (gi a — > v ■ a 

denote the Clifford multiplication. Thus, tangent vectors of M act on spinors 
by Clifford multiplication. Denote the space of smooth sections of a vector 
bundle E over M by r(E). Then 

V = cl o V : T(S) ^ r{T*M <g> §) £* E{TM <g> §) 4 T(§) 

where the identification between r(T*M £g> §) and r(TM (g> §) uses the Rie- 
mannian metric of M. 

Lemma 3.4.5 Let M be even dimensional, and let be the half spinor 
bundles for a spin or a spin c structure on M . Then the Dirac operator V 
(Da) maps T (S^) to r (§J). 

Proof. By Corollary 3.4.1, V, and similarly V a, leaves the decomposition into 
sections of even and odd degree invariant, while Clifford multiplication by e* 
interchanges sections of even and odd degree. □ 

We recall from Corollary 1.8.3 that on the bundle S„ of spinors, we have 
a pointwise Hermitian product (•,•) (invariant under Spin(n)). We suppose 
now that M is compact. We may then form the associated L 2 product 

(eri,cr 2 ) := J (<ri(ar), a 2 (x)) * (1) 

M 

where *(1) is the volume form of M (see 2.1.18)). 

Lemma 3.4.6 Let M be a compact Riemannian manifold with a spin struc- 
ture. Then the corresponding Dirac operator V is formally self adjoint, i.e. 

[Dai, a 2 ) = (ai,Va 2 ) (3.4.9) 


for all spinor fields <j\, 02 ■ 

Proof. Let x € M, and choose normal coordinates centered at x. With e* := 
j)|t, we then have at x 

= 0 for all i,j (cf. Theorem 1.4.4 and Corollary 3.3.1) (3.4.10) 
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We then have 

(Vo 1 (x),o 2 (x)) = {e i V ei <7 1 (x),a 2 (x)) 

= —(V ei cri(x), ei<j 2 (x )) since (•, •) is invariant under Clif- 
ford multiplication by the unit 
vector e, 

= -e i (o 1 (x),e i o 2 (x)) + (cr 1 (x),V ei (e i a 2 )(x)) 

since V is a metric connection 
= -ei(ai(x),eicr 2 (x)) + (oi(x) , erf ei o 2 (x)) by (3.4.10) 

= —ei(ai(x), eia 2 {x)) + (o 1 (x),Vo 2 (x)). 

We now consider V 1 = (oi(x), ei 0 2 (x)) as the i th component of a vector field 
V (in fact V is a complexified vector field, i.e. a section of TM (g> C). The 
preceding formula then becomes 

(Vo\(x),o 2 (x)) = — divF(a:) + (<ti (x),Va 2 (x)). (3.4.11) 

Since all terms in (3.4.11) are independent of the particular choice of coordi- 
nates, they continue to hold regardless of whether (3.4.10) is satisfied. (This 
point has been discussed in §3.3, e.g. in the derivation of Lemma 3.3.4, but 
since this an important computational trick, we repeat it here). Since 

J dhW(;r) * (1) = 0 

M 

by the Gauss theorem (see the discussion in §2.1), (3.4.9) follows by integrat- 
ing (3.4.11). □ 

Corollary 3.4.2 On a compact spin manifold M, Vo = 0 for a spinor field 
iff V 2 o = 0. 

Proof. This follows from 

(V 2 o, o) = ( Vo , Vo) 

by Lemma 3.4.6. □ 

Definition 3.4.2 A spinor field satisfying Vo = 0 is called harmonic. 

We now come to Weitzenbock formulas that constitute analogues of The- 
orem 3.3.3. 

Theorem 3.4.1 Let M he a spin manifold with a local orthonormal frame 
field ei, . . . , e n (as in Lemma 3.3.4, n = dim M ). Then the Dirac operator V 
satisfies 
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v 2 = -v;U 


4*’ 


where R is the scalar curvature of M. 


(3.4.13) 


Proof. As in (3.4.10), we assume 


V 6i (e.j) = 0 at the point x € M under con- (3.4.14) 

sideration, for all i, j, 

as well as 

[ei,ej]=0 since this holds for all coordinate (3.4.15) 

vector fields e, = 

We compute, for a spinor field cr, at x, 


V 2 a = ejV ej ((eiV ei ) a) = e^V 6j V ei u by (3.4.14) 

= -V ei V ei c r 

i<j 

because of = — 28ij 

= ^^ 7 e i e i tT + X] e j e «-^( e j> e *)cr (3.4.16) 

i<j 

by (3.3.42) and where i?(-, •) 
is the curvature tensor of 
the Levi-Civita connection 
V and where we have used 
Theorem 3.1.2 and (3.4.15) 


R(ej, ef) here acts on spinor fields, and if we express this operator w.r.t. our 
local frame field e k , we obtain a factor | as in (3.4.6), coming from Lemma 


1.8.3: 


R{&i ? Oj ) 


e.j)e k , &i)e k ei, 

1 k<l 


(3.4.17) 


where e^e; again operates by Clifford multiplication. In order to derive 
(3.4.13) from (3.4.16), it thus remains to evaluate 


5 ej)e k , ei)eieje k ei = t ^ (R(ei,ej)e k ,ei)eieje k ei (3.4.18) 

j<i k<l 


If i , j, k are all distinct, 


GiGjGk — GjGj^Gi — 

and the first Bianchi identity (see Lemma 3.3.1) implies in this case that 
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The remaining terms are 

g ^ ' ( (.R{pi , &k) &k , C T (R(.&k, e ? ;)efc, C-l) C k CiC k Cl) 

i,j,k,l 

= - ^^2(R{ei,e k )e k ,ei}eiei by (3.1.33) and e| = -1 

i,k,l 

= where Ru is the Ricci tensor 

= -Ru since Ru = Ru (see (3.3.19)), + e;ej = -2 S X j 



□ 

Theorem 3.4.2 Let M be a spin c manifold with a local orthonormal frame 
field e\, ... ,e n and a spin c connection V^. The Dirac operator T>a satisfies 

V 2 A = -V\,e iei + \R+\FA (3.4.19) 

where Fa, an imaginary valued two-form, is the curvature of A. (Fa acts on 
spinors by Clifford multiplication; in our frame field, JJ F 'A,ij e i A becomes 

i<j 

| JJ FA,ijCiej as usual.) 

i<j 

Proof. The proof is the same as the one of Theorem 3.4.1, except for the 
additional u(l) part A of the connection that leads to the additional Fa in 
the formula. □ 


Perspectives. See the references given in the perspectives on §1.8. The Dirac 
operator on the spinor bundle (Definition 3.4.1) was introduced by Atiyah and 
Singer [8] in their investigation of the index of elliptic operators. The simpler Dirac 
operator on the Clifford bundle had been studied earlier by Kahler[152] 


3.5 The Bochner Method 

Lemma 3.5.1 Let (ej)i=i,...,d be a local orthonormal frame field on M, with 
dual coframe field as in Lemma 3.3.6. 

If to is a harmonic form, then 

—A(lv,oj) =2(V e ,w, V ei <x>) 

— 2 (w, 77 * A i(ej)R(ei, ej)co) . 


(3.5.1) 
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Proof. Let Xq be a point in M where we perform the computations, and 
choose normal coordinates centered at £0 and ej = . Again, the formulae 

will not depend on the choice of a local orthonormal frame. Then, by the 
remark after Theorem 3.3.3 and (3.3.42) 

-A{uj,lu) = V ei V ei 

= 2(V ei <x’, V ei w) + 2 (lo, V ei W ei u>) . (3.5.2) 

(3.3.42) and (3.3.41) then yield (3.5.1), since Alo = 0 by assumption. □ 

Lemma 3.5.2 With the notation of Lemma 3.5.1, we have for a harmonic 
1-form u) on M 

— A(uj,u>) = 2|Vw| 2 + 2 Ric(u>, u>) (3.5.3) 

with |Vw| 2 := (V ei u>, V ei w) and writing u) = firf, 

Ric(u, u>) := Ric(fiei, fjef) = fcfj Ric(ei,ej). 


Proof. We compute the curvature term in (3.5.1) for a 1-form u> : 

(u>, rf A b(ej)R(ei, ej)uf) 

= {ferf, rf A i(ej)R(ei, ej)f k r] k ) 

= -ftfkiv^V 1 A L(ej)Rkmijri m ) 

= —fefkiv*! Rkjijrf) 
f tf k Rkjc, 

= — fcfkRu 
= — Ric (cu,u>) 

where we have used the tensor notation of 3.3, (e.g. (3.3.6) and (3.3.18)). □ 

Theorem 3.5.1 (Bochner) 

(i) Let M he a compact Riemannian manifold with nonnegative Ricci 
curvature. Then every harmonic 1-form u is parallel (i.e. Vw = 0^. 
In particular, the first de Rham cohomology group satisfies 

dim H^ r (M,W) < d(= dirnA-f). 

(ii) If M is a compact Riemannian manifold of positive Ricci curvature, 
then M has no nontrivial harmonic 1-form. Thus, 

hUm,R) = { 0 }. 


Proof. We integrate formula (3.5.3): Then from (2.1.17) 
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0 = - J A(ui, to) * (1) = 2 J (|Vw| 2 + Ric(w,w)) * (1). (3.5.4) 

M M 

By our assumption, the integrand on the right hand side is pointwise non- 
negative. It therefore has to vanish identically. This implies in particular 

Vw = 0, (3.5.5) 


and u) is parallel. 

A parallel 1-form is determined by its value at one point of M (cf. the 
discussion before Definition 3.1.2). 

Therefore, the dimension of the vector space of parallel 1-forms is at most 
the dimension of the cotangent space T*M, i.e. d. Likewise, (3.5.4) implies 

Ric(w,w)=0. (3.5.6) 

Thus, if M has positive Ricci curvature, we must have wsO. □ 

Remark. In (ii) of the preceding theorem, it suffices to assume that M has 
nonnegative Ricci curvature, and that there exists some point Xq where the 
Ricci curvature is positive. Namely, from 

Ric(w, to) = 0, 

we then conclude that, co(xo) = 0, and since u> is parallel, it then vanishes 
everywhere. 

Below, we shall derive a stronger result (Corollary 4.3.1, Theorem of 
Bonnet-Myers) on the topology of Riemannian manifolds of positive Ricci 
curvature by a different method. Nevertheless, the Bochner method is an im- 
portant tool in Riemannian geometry because it has a rather general range 
of applicability. It also applies to harmonic sections of bundles (suitably de- 
fined), harmonic mappings (see chapter 8) etc. The harmonicity of the object 
under consideration will imply a formula of the type of (3.5.1). The essential 
point of (3.5.1) is that instead of third order derivatives that would appear 
for a general, nonharmonic object, one only has a commutator term given by 
a curvature expression. The other term on the right hand side is a square, 
hence nonnegative. If one then assumes that the curvature is such that the 
curvature term also is nonnegative, both terms have to vanish identically, be- 
cause the integral of the left hand side vanishes. The vanishing of the square 
term then implies that the object is parallel. If the curvature is even positive, 
the vanishing of the curvature term implies that the object itself vanishes. 

We shall see another instance of the Bochner method in 8.7. 

When combining the preceding reasoning with the Weitzenbock formula 
of Theorem 3.4.1, we get 
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Theorem 3.5.2 (Lichnerowicz) Let M be a compact spin manifold. If M 
has nonnegative scalar curvature, then every harmonic spinor field is parallel. 
If the scalar curvature is positive, then every harmonic spinor field vanishes. 

Proof. As in the proof of Lemma 3.5.1, we compute for a harmonic spinor 
field cr 

-A(a, a) = 2(V ei cr, V 6i cr) + 2(<r, V^.cr) 

= 2 (V ei a, V ei <r) + l -R(a, a) by (3.4.13). 

As in the proof of Theorem 3.5.1, we integrate this formula to get 

2 J (V ei cr, V ei a) * (1 )+\J R(<r, °) * (1) = 0 

If R > 0, both integrands have to vanish identically; in particular V ei = 0 
meaning that cr is parallel. If R > 0, a = 0. □ 


Perspectives. Further applications of the Bochner method may be found in the 
monograph [259]. See also the Perspectives on 8.7. 


3.6 The Geometry of Submanifolds. Minimal 
Submanifolds 


Let M be an m-dimensional submanifold of the n-dimensional Riemannian 
manifold N. The metric (., .) on N induces a metric on M, as described in 
1.4. The question arises how to compute the Levi-Civita connection V M of 
M from the one on N, V ;V . 

Theorem 3.6.1 We have 

Vfy=(V^y) T for X, Y e r(TM), (3.6.1) 

where T : T X N — » T X M for x £ M denotes the orthogonal projection. 

Proof. In order that the right hand side of (3.6.1) is defined, we have to 
extend X and Y locally to a neighbourhood of M in N. This is most easily 
done in local coordinates around x £ M that locally map M to R m C R". 
The extension of X = C‘( x ) J J r - then for example is 

X(x 1 ,...,x n ) = S ff{x 1 ,...,x n )^-. 

i = 1 ox 1 
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We then have 

(X,Y){x) = (X,Y){x) 

[X,Y](x) = [X,Y](x). 

Since (3.3.1) has to hold for V M as well as for X N , (3.6.1) follows. (It follows 
from the representation of V w by Christoffel symbols, that (V^F) T does 
not depend on the chosen extensions. It is also clear that (V xY) T defines a 
torsion free connection on M because X N is a torsion free connection on M, 
and since XyY — XyX — [X, Y] vanishes, also the part of this expression 
that is tangential to M has to vanish.) □ 

With the help of Theorem 3.6.1, we may easily determine the Levi-Civita 
connection of S n C R n+1 . 

Let v(x) be a vector field in a neighborhood of x 0 € M C N, that is 
orthogonal to M, i.e. 


{is(x),X)=0 for all X &T X M. (3.6.2) 

We denote the orthogonal complement of T X M in T X N T X M^. The bundle 
TM 1 - with fiber T X M^~ at x € M is called normal bundle of M in N. (3.6.2) 
thus means 

is(x) € TxM 1 - . 

Lemma 3.6.1 (V(^) T (:r) only depends on v(x), the value of is at x. 

Proof. For a real valued function / in a neighborhood of x 

(VxM T (2) = ( X(f)(x)u(x)) T + f(x)(\7 x v) T {x) 

= /W(V x i/) T (a;), 

since v{x) £ T X M - 1 . □ 

Lemma 3.6.1 makes the following definition possible. 


Definition 3.6.1 The second fundamental tensor of M at the point x is the 
map 

S : T X M x TxM 1 - -> T X M , (3.6.3) 

defined by S(X,is) = (Vfr) T . 

Lemma 3.6.2 For I,Y € T X M 1 

e„{X,Y):=(S{X,v),Y) (3.6.4) 


is symmetric in X and Y. 
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Proof. 

MX,Y) = <(V» T ,F) 

= <V%v,Y), 

= —(i/jV^Y), 

= -{i/,v$x + [x, y]) 

= -(u,V$X) t 

= (X?v,X), 

= <(V^) T ,X), 

= UY,X). 

Definition 3.6.2 £„(■,■) is called th 
N. 


since Y £ T X M 

since {v,Y) = 0 and X N is metric 
since V N is torsionsfree (3.6.5) 
since [X, Y] £ T x M,v £ TxM 1 - 
since (v, X ) = 0 and V s is metric 
since X £ T X M 

□ 

i second, fundamental form of M w.r.t. 


Remark. The first fundamental form is the metric, applied to X and Y £ 
T X M, i.e. (X, Y). 

For a fixed normal field v, we write S V (X) = S(X,is). S v : T X M — > 
T X M then is selfadjoint w.r.t. the metric (., .), by Lemma 3.6.2. Suppose now 
{v,v) = 1; i.e. v is a unit normal field. The m eigenvalues of S v which are 
all real by self adjointness are called the principal curvatures of M in the 
direction v. and the corresponding eigenvectors are called principal curvature 
vectors. 

The mean curvature of M in the direction v is 

H u := — trSV. 
m 

The Gauss-Kronecker curvature of M in the direction v is 

K v := det S v . 

For an orthonormal basis ei, ... , e m of T X M , 

K v = det (l v (ei,ej)). 


We now consider the case where M has codimension 1, i.e. n = m+ 1. In 
this case, for each x £ M, there are precisely two normal vectors v £ T X M L 
with (u, v) = 1. We locally fix such a normal field and drop the subscript v. 
If we would choose the opposite normal field instead, t and S would change 
their sign, and the mean curvature M as well. For even to, however, the 
Gauss-Kronecker curvature does not depend on the choice of the direction of 
v. 

Furthermore, because of {v, v) = 1 is always tangential to M, and 

geometrically, it measures the “tilting velocity” with which v is tilted (relative 
to a fixed parallel vector field in N) when moving on M in the direction X. 
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We now want to compare the curvature tensors of M and N, R M and R N . 
It turns out that their difference is given by the second fundamental tensor; 
namely 

Theorem 3.6.2 (Gauss equations). Let M be a submanifold of the Rieman- 
nian manifold N, in = dim M, n = dim N, k = n — m,x € M, v\,...,Vk an 
orthonormal basis for (T X M) X , S a := S Va , £ a := i„ a (a = 1, . . . , k). With the 
convention that a Greek minuscule occuring twice is summed from 1 to k, for 
X , Y,Z,W e T X M 

R m (X, Y)Z - (. R n (X , Y)Z) t ( 3 . 6 . 6 ) 

= e a (Y,z)s a x -£ a (x, z)s a Y 

and hence also 

(R m (X, Y)Z, W) - (. R n (X , Y)Z, W) ( 3 . 6 . 7 ) 

= £ a (Y, Z)£ a (X, W) - £ a (X, Z)£ a (Y, W) 


Proof. Since everything is tensorial, we extend X. Y. Z, IT, to vector 

fields in TM and TM ± , resp., with the v a always being orthonormal. 

V¥Z = (V$Z) T + {V?Z) X =V¥Z+(v a ,V$Z) v a , 

since the v a form an orthonormal basis of T M x . 

Hence 


= X N x X™Z + X{{v a , X%Z))v a + (v a , X?Z)V%v a , 


i.e. 


(V^V^Z) T 


{v a ,V$Z)(V%v a ) T 
V™V™Z-£ a (Y,Z)S a (X) by (3.6.5) 


Analogously 


(' Vf V%Z) T = Vf ' Vf Z - £ a (X, Z) S a (Y ) . 


(3.6.8) 


(3.6.9) 


Moreover, 

(Vf X Y] Z) T = V^ XtY] Z by Theorem 3.6.1. (3.6.10) 

(3.6.6) follows from (3.6.8) - (3.6.10), and (3.6.7) follows from (3.6.6). □ 


The “theorema egregium” of Gauss is the following special case of Theo- 
rem 3.6.2: 


Corollary 3.6.2 For a surface M in R 3 (i.e. m = 2 ,n = 3) the Gauss 
curvature, defined as the determinant of the second fundamental form, hence 
defined through the embedding of M in R 3 , coincides with the Riemannian 
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curvature of M which is determined by the metric, hence independent of the 
embedding. Thus, the Gauss curvature does not depend on the embedding of 
M into R 3 either. □ 

Definition 3.6.3 A Riemannian submanifold M of a Riemannian manifold 
N is called totally geodesic if all geodesics in M are also geodesics in N. 

Theorem 3.6.3 M is totally geodesic in N if and only if all second funda- 
mental forms of M vanish identically. 

Proof. Let c : / — > M be geodesic in M, i.e. V^c = 0. Because of (VA c) T = 
(Theorem 3.6.1), c is geodesic in N if and only if (V^ c) 3- = 0, i.e. 

(Vf c, v) = 0 for all v G TM ± . 

Now 

(V^ c, v) = — (c, v), since (c, v) = 0 and V w is metric 

= -£ v (c, c). 

The claim directly follows. □ 

For example, each closed geodesic in a Riemannian manifold defines a 
1-dimensional compact totally geodesic submanifold. 

The totally geodesic submanifolds of Euclidean space are precisely the 
affine linear subspaces (and their open subsets). The closed totally geodesic 
subspaces of the sphere S n C R™ +1 are precisely the intersections of S n with 
linear subspaces of R™ +1 , hence spheres themselves. This follows directly from 
the description of the geodesics on S n in 1.4. A generic Riemannian manifold, 
however, does not have any totally geodesic submanifolds of dimension > 1. 

We want to briefly discuss a global aspect. 

Let M be an oriented submanifold of the oriented Riemannian manifold 
N. This means that M itself is an oriented manifold whose orientation coin- 
cides with the one induced by N. If thus for x G M e\, . . . , e„ is a positive 
basis of T X N for which e±, . . . , e m are tangential to M, then e\, . . . , e m con- 
stitute a positive basis of T X M. 

If under this assumption, we have n = m + 1, we may also determine the 
sign of the unit normal field v by requiring that if e\, . . . ,e m is a positive 
basis of T X M , then ei, . . . , e m , v is a positive basis of T X N. Suppose now that 
N = R ra , i.e. that M is an oriented hypersurface of R". Let p : TR" — > R" 
map each fiber of TR™ isomorphically onto R™, in the usual canonical manner, 
i.e. by parallel transport into the origin. 

Definition 3.6.4 po v : M — > S'™ -1 is called the Gauss map of M. 

The Gauss-Kronecker curvature, i.e. the Jacobian of du(x) : T X M —> 
T X M, then becomes the Jacobian of the Gauss map. It thus measures the 
infinitesimal volume distortion of M by the Gauss map. 
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Theorem 3.6.2 allows an easy computation of the curvature of the sphere 
S n C R" +1 . Namely, for x = (x 1 , . . . , x n+1 ) € S n , a unit normal vector v(x) 
is given by 

, s id 

-M = * a*- 


Furthermore, 


V R e +1 ^(x) 


d 

dxi 



0 

dxi 


Since we have already seen that the isometry group of S n operates transitively 
on S n , we may consider w.l.o.g. the north pole (0, 0, ... , 0, 1). g|r , . . . , 
are tangential to S n at this point. It follows that 


and 


We conclude 


S( 'Ox ^ dxi 


d , ( , d 


d 

dxi 


for j = 1 , . . . , n 


, d d 
' dxi ’ Q x k 


) = ( 


d d 
dxi ’ g x k 


= 6 


jk- 


' dx i ’ dxi ^ Q x k ’ ' 

In particular, the sectional curvature is 1. 
We also obtain the formula 


S :l kSu i i k / ■ 


(3.6.11) 


R sn (X,Y)Z= (Y, Z)X - (X, Z)Y. (3.6.12) 


We want to consider a particular class of submanifolds in more detail, namely 
those that are critical points of the volume functional. 

Let M be an m-dimensional submanifold of N, with frame ei,...,e m , 
coframe 77 , . . . ,rj m and volume form fj as before, and let 


<P : M -> M 

be a diffeomorphism. Let ei, . . . , e m be a frame on M, rj the volume form. 
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Then 


Vol(M) = 

/* 

— 

f $*v 


J 

M 


J 

M 


0*?) 1 A ... A &*fj n 


M 


j |#*ei A ... A ^*e m |?7 

M 

J (4>*ei A ... A <£*e m ,<£*ei A ... A 

M 


(3.6.14) 


We now consider a more special situation. We define a local variation of M 
to be a smooth map 


F : M x (-£,£)-> TV (e > 0) 


with 


supp F := {x G M : F(x,t) ^ x for some t G (—£,£)} (3.6.15) 

being a compact subset of M and 

F(x, 0) = x for all x G M. 

For small enough |f|,<£ t (-) := F(-,t) then is a diffeomorphism from M onto 
a submanifold M t of A/”, by the implicit function theorem. We assume that 
£ > 0 is chosen so small, that this is the case for all t G (—£,£). Since the 
subsequent computations are local, we also assume that {x G M : F(x,t) yf 
x} is orientable and that e \, . . . , e m is a positively oriented orthonormal basis. 
The variation of volume then is (by (3.6.14)) 

^Vol(* t (M))| t=0 

dr 

= ~j~ t / (&t*e i A ... A $t*em,$t*e i A ... A $ t *e m )^ r)\ t= o 

M 

_ ™ f (@t*ei A . . . A §- t ^t*e a A . . . A A . . . A & t *e m ) 

a=i J \^t^ e i A . . . A 4> t *e m \ 

M 

Putting 

9 
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we obtain 

— Vol(# t (M))] t=0 

m /' (e i A ... A Vf X A ... A e m , ei A . . . A e m ) 

I — fj 

a=u |ei A . . . A e m | 

M 

Namely, if c a (s) is a curve on M with c Q (0) = x,c' a ( 0) = e a , and c a (s,t) = 
^t(c Q (s)), then 

C|s— 0 

and 

(9 9 9 3 9 

9^* e “' f =° = ^c a (s,f)| s=t=0 = ^c a (s,%= t =o 

= V|X| S=0 = V£*. 

Therefore, 

|vol(^(M)), t=0 = j(\7lX,e a ) V 

M 

= J{e a (X,e a )-(X,V^e a )} V . (3.6.16) 

M 

Now e a (X,e a ) = divA T , and since X vanishes outside a compact subset of 
M (see(3.6.15)), we have by Gauss’ theorem 


^ €(%) 0. 

M 

As in the proof of Lemma 3.3.4 (3), we may assume that at the point under 
consideration 

V£e a = 0. 

We then obtain from (3.6.16) 

j\o\ (<P t (M)) |i=0 = - j(X\vle a )- V . (3.6.17) 

M 


We conclude 

Theorem 3.6.4 A submanifold M of the Riemannian manifold N is a crit- 
ical point of the volume function, i.e. 


|vol(^(M))| t=0 = 0 


(3.6.18) 
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for all local variations of M if and only if the mean curvature H v of M 
vanishes for all normal directions v. 


Proof. We choose an orthonormal basis is\, (k = n — m) of T X M L for 
x € M and write 




(3.6.19) 


Then 

(A x , Vf Q e a ) = e tr S Vj = rnf J H Vj . (3.6.20) 

Since every section X of TM 1 - over M with compact support on M defines 
a local variation 

F(x,t) := exp x tX(x) 


of M, (3.6.18) holds if and only if (3.6.20) vanishes for all choices of and 
the conclusion follows. □ 


Definition 3.6.5 A submanifold M of the Riemannian manifold N is called 
minimal if its mean curvature iJ„ vanishes for all normal directions v. 


We want to consider a somewhat more general situation. We let M and 
N be Riemannian manifolds of dimension to and n, resp., and we let 


/ :M -> N 


be an isometric immersion. This means that for each p £ M, there exists a 
neighborhood U for which 

f'-U^ f(U) 

is an isometry (/([/) is equipped with the metric induced from N). The 
point here is that f(M) need not be an embedded submanifold of N but 
may have self-intersections or may even be dense in N. We may then define 
local variations F(x,t) : M —> N with F(x, 0) = f(x) as before, and f(M) is 
critical for the volume functional if and only if its mean curvature vanishes, 
in the sense that for all U as above, f(U) has vanishing mean curvature in 
all normal directions. Such an f(M ) then is called an immersed minimal 
submanifold of N. We now want to write the condition for the vanishing of 
the mean curvature, namely 

(Vf^)- 1 = 0 (3.6.21) 


in terms of /. 

For that purpose, we introduce normal coordinates at the point x £ M 
under consideration, i.e. at x 


{ dx<*' dxP* ~ 5aP 

V M 9 tAj = o for a, (3 = 1, ... ,m. 

55^ ax 13 


(3.6.22) 
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Here , V M is the Levi-Civita connection of M, and because / is an isometric 
immersion, for all X and Y £ T X M, 

Vfr = Vj^ } /*F = (V£*/*F) t by Theorem 3.6.1. (3.6.23) 

(This fact may also be expressed by saying that V M is the connection in the 
pull back bundle induced by the Levi-Civita connection of N ). 

e := f 

a * dx a dx a df l 

where (/ 1 ,...,/ n ) now are local coordinates for N near f(x). Thus, for a 
function ip : N -> R, (e a (ip))(f(x)) = g^ip o f(x). 

Then, computing at x, 

(V^ea) 1 - = V*e a by (3.6.22), (3.6.23) 

= V N. 

#£ 57T dx ° df j 

Qlff Q Qfi Qfk Q 

~ (. dx a ) 2 dp + d^d^ lik dp ' 

Here, Tl are the Christoffel symbols of IV. 

We conclude that f(M) has vanishing mean curvature, i.e. (3.6.21) holds 
if and only if 

(^ + 4(/W)^:|^=0 fa) = (3.6.24) 

(3.6.24) requires that the coordinates are normal at x. In arbitrary coordi- 
nates, (3.6.24) is transformed into 

-A M f j +l a0 {x)r 3 ik {f{x))^y^ =0 for j = 1, . . . , n (3.6.25) 

where Z\m is the Laplace-Beltrami operator of M (see 2.1) and ('y a p) a ,p=i,...,m 
is the metric tensor of M. 

In §8.1, solutions of (3.6.25) will be called harmonic maps. Thus, an iso- 
metric immersion is minimal if and only if it is harmonic. 

A consequence of (3.6.25) is 

Corollary 3.6.3 The one dimensional immersed minimal submanifolds of 
N are the geodesics in N. □ 

We now consider the case where N is Euclidean space R". In Euclidean 
coordinates, all Christoffel symbols Tf k vanish, and we obtain 


Corollary 3.6.4 An immersed submanifold o/R" is minimal if and only if 
all coordinate functions are harmonic (w.r.t. the Laplace-Beltrami operator 
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of the submanifold induced by the ambient Euclidean metric). In particular, 
there are no nontrivial compact minimal submanifolds of Euclidean space. 

Proof. The first claim follows from (3.6.25). The second one follows from the 
fact that, on a compact manifold, every harmonic function is constant by 
Corollary 2.1.2. And a manifold whose coordinate functions are all constant 
is a point, hence trivial. □ 

There is, however, a multitude of noncompact, but complete minimal 
surfaces in R 3 . Besides the trivial example of a plane, we mention: 

1) The catenoid, given by the coordinate representation 

f(s, t) = (cosh s cos t , cosh s sin t, s) . 

2) The helicoid, given by the coordinate representation 

f(s,t) = (t cos s, fsin s, s). 


3) Enneper’s surface, given by the coordinate representation 

s s 3 st 2 t t 3 s 2 t s 2 t 2 

— (2 6~ 2~’ ~2 + 6~ 2~’ ^ 


We leave it as an exercise to the reader to verify that these have vanishing 
mean curvature and hence are minimal surfaces indeed. 

In order to obtain a further slight generalization of the concept of a mini- 
mal surface in a Riemannian manifold, we observe that (3.6.25) is not affected 
if the operator occuring in that formula is multiplied by some (non- vanishing) 
function. In order to elaborate on that observation, we assume that A is a 
two dimensional Riemannian manifold and that coordinates x 1 , x 2 are chosen 
on E for which gp- and gp are always orthogonal and of the same length 
w.r.t. the metric (-,-) 7 of E, i.e. 

d d d d d d 

= n 

dx 1 ’ dx 1 7 dx 2 ' dx 2 7 ’ dx 1 ' dx 2 7 
This is equivalent to the metric 7 being represented by 

\ 2 {x)(dx 1 <8 dx 1 + dx 2 (8 dx 2 ) 

with some positive function A 2 (x) (x = (x 1 ,^ 2 )). Moreover, the precise value 
of A 2 (a;) is irrelevant for (3.6.26). 

In those coordinates, (3.6.25) becomes, for an isometric immersion / : 
E->N, 

1 

W(z) 


d 2 f d 2 p 


{dx 1 ) 2 {dx 2 )‘- 


+ W*»<ggr 


dp df k 
dx 2 dx 2 


)) = 0 , 


(3.6.26) 

(3.6.27) 
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and since as observed the factor vst-te is irrelevant, this becomes 

d 2 P , 9 2 r , ni , 9f j df k ^_ n /ocoo , 

(dx 1 ) 2 + ( dx 2 ) 2 +r ^(f( x ^( dx i ax 1 + a^ 2 dx 2 ^ 0) (3 ' 6 ' 28) 

Since f is required to be an isometric immersion, (3.6.26) becomes 

where now the metric is the one of N. 

In order to provide a conceptual context for a reformulation of the pre- 
ceding insights, we state 


Definition 3.6.6 A surface £ with a conformal structure is a two dimen- 
sional differentiable manifold with an atlas of so-called conformal coordinates 
whose transition functions z = p(x) satisfy 

dz 1 <8> dz 1 + dz 2 <8> dz 2 = p 2 (x)(dx 1 ® dx 1 + dx 2 ® dx 2 ) (3.6.30) 


(z = (z x ,z 2 )^x = (a: 1 ,® 2 )), for some positive function p 2 (x). A map / : £ — » 
N from a surface £ with a conformal structure into a Riemannian manifold 
N is called conformal if in conformal coordinates always 


( df_ 9/, 

dx 1 ' dx 1 dx 2 ' dx 2 


and 


df df 


= 0 . 


'9a; 1 ’ dx 2 1 

In order to interpret (3.6.30), we compute 

dz 1 <g> dz 1 + dz 2 <g> dz 2 = (p\ip\j + p 2 ip 2 ,)dx l <g> dx* . 


(3.6.31) 


(3.6.30) then implies 

dtp 1 dp 1 dp 2 dp 2 _ dp 1 dp 1 dp 2 dp 2 
dx 1 dx 1 9a; 1 9a; 1 dx 2 dx 2 dx 2 dx 2 

and 

VV dp 2 dp 2 
dx 1 dx 2 9a; 1 dx 2 

Thus, the coordinate transformations are conformal in the Euclidean sense. 
A special case of a surface with a conformal structure is a Riemann surface 
as defined in Definition 8.2.1 below. 

We also observe that (3.6.31) is independent of a particular choice of 
conformal coordinates, by a computation analogous to the one just performed. 


Definition 3.6.7 Let A be a surface with conformal structure, N a Rieman- 
nian manifold. 

A (parametric) minimal surface in A 1 is a nonconstant map f : £ —> N 
satisfying (3.6.28) and (3.6.29). 



Exercises for Chapter 3 169 


This definition includes the previous definition of a minimal surface, i.e. 
a twodimensional minimal submanifold of N. Namely, the pull back ( f*g ) a /3 
of the metric tensor gij of N is given by 


lapx) = gij(fx)) 


dp dp 
dx a dx@ 


,df_ df_ 
[ dx a ’ dxP 1 ' 


and if / is conformal, i.e. satisfies (3.6.29), then 


n /acf3(,X > ) — A (x)^a/3 


for some function A 2 (a;). 

If A 2 (:r) p 0, this is the situation previously discussed, and the vanish- 
ing of the mean curvature of f(E) was shown to be equivalent to (3.6.28). 
A 2 (a;) p 0 means that the derivative of / has maximal rank at x, and thus is 
a local immersion. Therefore, the only generalization of our previous concept 
admitted by Definition 3.6.7 is that we now include the degenerate case where 


df_ dp 
dx 1 ’ dx 1 


>= 0 =< 


dp dp 

dx 2 ’ dx 2 


(3.6.32) 


at some (but not all) points of S. 

It may actually be shown that this can happen at most at a discrete set 
of points. 


Perspectives. The theorema egregium of Gauss was the starting point of mod- 
ern differential geometry. It provided the first instance of a nontrivial intrinsic 
differential invariant of a metric, and it motivated Riemann’s definition of sectional 
curvature. For more details, we refer to [71] . In that textbook, also parametric min- 
imal surfaces in ]R 3 are treated. For a comprehensive treatment of minimal surfaces, 
we refer to the monographs [55], [195]. A good reference for minimal submanifolds 
of arbitrary dimension and codimension is [261]. 

Some further discussions about minimal surfaces may be found in chapter 8. 


Exercises for Chapter 3 

1) Compute the transformation behaviour of the Christoffel symbols of 
a connection under coordinate transformations. 

2) Let E be a vector bundle with fiber C” and a Hermitian bundle metric. 
Develop a theory of unitary connections, i.e. of connections respecting 
the bundle metric. 

3) Show that each vector bundle with a bundle metric admits a metric 
connection. 

4) Let Xo £ M, D a flat metric connection on a vector bundle E over M. 
Show that D induces a map 7Ti(M, xo) — »■ 0(n), considering O(n) as 
the isometry group of the fiber E Xo . 
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5) Let 5" := {x € R” +1 : \x\ = r} be the sphere of radius r. Compute 
its curvature tensor and volume. 

6) Consider the hyperboloid in R 3 defined by the equation 

x 2 + y 2 - z 2 = — 1, z > 0. 

Compute its curvature and volume. 

7) Verify that the catenoid, the helicoid, and Enneper’s surface are mi- 
nimal surfaces. 

8) Determine all surfaces of revolution in R 3 that are minimal. (Answer: 
The catenoid is the only one.) 

9) Let F : M m — » R m+1 be an isometric immersion (m = dim M). Give 
a complete derivation of the formula 

AF = m?y 

where A is the Laplace-Beltrami operator of M and ?y is the mean 
curvature vector of F(M). 

10) Let F : M m — » S n C R" +1 be an isometric immersion. Show that 
F(M) is minimal in S n if and only if there exists a function ip on M 
with AF = pF and that in this case necessarily p = m. 

11) Show that for n > 4, there exists no hypersurface (i.e. a submanifold 
of codimension 1) in R™ with negative sectional curvature. 

12) Verify the formula T> = cl o V given in 3.4. 
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4.1 1st and 2nd Variation of Arc Length and Energy 

We start with a preliminary technical remark: 

Let M be a d-dimensional Riemannian manifold with Levi-Civita connec- 
tion V. Let H be a differentiable manifold, and let f : H —> M be smooth. 
In the sequel, H will be an interval I or a square / x / in ]R 2 . Since / is not 
necessarily injective, it is not always possible to speak in an unambiguous 
way about the tangent space to f(H) at a point p £ f(H), even, if / is an 
immersion. Let for example p = f(x) = f{y) with x ^ y. If / is an immersion, 
we may restrict / to sufficiently small neighbourhoods U and V of x and y 
such that f(U) and f(V) have well defined tangent spaces at p. Thus, in a 
double point of f(H), the tangent space can be specified by specifying the 
preimage (a: or y) . This can be formalized as follows: We consider the bundle 
f*(TM) over H, pulled back by /. The fiber over x £ H here is Tf^M. This 
process already has been treated in a more general context in Definition 1.5.5. 
We now introduce a connection /*(V) on f*{TM) by putting for X £ T X H , 
Y a section of f*(TM), 


(rV) x Y:=V df{x) Y (4.1.1) 

(here, f*(TM) x is identified with Tf^M). 

As in 3.4, in order that the right hand side is well defined, Y first has to be 
extended to a neighbourhood of /(7J); as in 3.4, however, it turns out that 
the result will not depend on the choice of extension. In the sequel, instead 
of (/*V), we shall simply write V, since the map / will be clear from the 
context. 

A section of f*(TM ) is called a vector field along f. An important role 
will be played by vector fields along curves c : I — > M, i.e. sections of c*(TM). 

Let now c : [a, b] — > M be a smooth curve, e > 0. A variation of c 
is a differentiable map F : [a, b] x (— e,e) — > M with F(t, 0) = c(t) for 
all t £ [a, 6]. The variation is called proper if the endpoints stay fixed, i.e. 
F(a, s) = c(a),F(b, s) = c(b) for all s £ (— £,e). We also put c s (t) := c(t, s) := 
F(t,s), c(t,s ) := §ic(t,s ) (more precisely, dF(^)c(t, s)), c'{t,s ) = -§~ s c(t,s ) 
(more precisely dF(J^)c(t, s)). 
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As in 1.4, let L( 7 ) and £’( 7 ) denote the length and the energy of a curve 
7 . The following lemma is a reformulation of formulae from 1.4. 

Here, we want to give an intrinsic proof: 

For simplicity, we shall write L(s),E(s) in place of L(c s ), E{c s ) resp. 


Lemma 4.1.1 L(s) and E(s) are differentiable w.r.t. s , and we have 


r ( Ojj t) W,Vac)' 

2/(0) = / dt{ i 

J I (c,c )2 (c,c ) 2 


dt. 


(4.1.2) 


E'{ 0) =(c'(&, 0 ), c( 6 , 0 )) - (c'(a, 0 ), c(a, 0 )) 
f dc dc 

wt {t ’ a))du 


(4.1.3) 


Proof. 


. 1 /',3c. . dc. , 

e(s) = 2 J { dt {t ' s) 'm {t ’ s))dt 

a 

a 

b 

f dc dc 

= I CV -p-(t,s))dt, since V preserves the metric 

J ot at 

a 

(to be precise, we should write £*(V)_a or V rf cv_a 

ds '9s' 

instead of V _a_ ) 

ds 

b 

r Qq Qc 

= / (V_g_ — (t, s), — (t, s))dt, since Vis torsionfree 
J 8t (jg 

a 

= J (s ( S (t ’ s) ' l (t - 5)) - <!’ v * l (t - s)> ) dt 

a 

dc t _ b f.dc dc 
’ dt^ t=a ~ j ^Ws ,v &dt^' s ^ dt ' 


t dc dc 
v ds 
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and similarly 


0 

/ , dc , . dc , . . i , 


sW 


<if(M), ff(M)> = 


I(c',c) (c',Vac) 


In the special case where c = Co is parametrized proportionally to arc- 
length, i.e. ||c(t, 0)|| = const., (4.1.2) becomes 

b 

L\ 0) = -^t((c',c)|^; 5 = 0 o - [ <c', V a c)dt). (4.1.4) 

(C, C) 2 J 

a 

Lemma 4.1.1 implies that c is stationary for E (w.r.t. variations that keep 
the endpoints fixed) and if parametrized proportionally to arc length, also 
stationary for L if and only if 


V e_ c(t, 0) = 0. 


(4.1.5) 


We recall that V a_ stands for now dF(-j^) = s) = c, and 

(4.1.5), as to be expected is the equation for c being geodesic. 

For the case where c = Co is geodesic, we now want to compute the second 
derivatives of E and L at s = 0 : 

Theorem 4.1.1 Let c: [a, 5] — > M be geodesic. Then 


O 

E"( 0) = [ (V»c'(t,0),Vic'(t,0M I 

J dt dt 

a 

b 

- J (R(c, c')c', c)df| s=0 + (V^c / ,c)|*I^ s s =o 

a 

and with c ;± := d — ( tt§tt ^ c/ ) tt§tt (the component of c' orthogonal to c) 


(4.1.6) 


i,,(0) = Mi/ ((v ^ c,± ’ v - c,±) 


-(i?(c,c^)c^,c))dt+(Vac',c)|^} . (4.1.7) 

So J s=0 
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An important point is that for a geodesic c, the second variation depends 
only on the first derivative J^c(f, s)i s= o of the variation, but not on higher 
derivatives. This fact will allow the definition of the index form I below. 

Proof. According to the formulae of the proof of lemma 4.1.1 

b 


d 2 
ds 2 


E(s) = /I (v * 

a 

b 

a 

b 

+ /< V A V * !<*■»■ §<*•»»«“ 

a 

again, since V is metric and torsionfree 
b 

= /< v *K <M) ' V A £<*■»»<“ 

a 

b 

+ /< v SAa|A »),§(*■»)><« 


- 


9c 9c \ dc dc 


dt’dsj ds ’ 9t 


)dt by def. of R. 


Since c is geodesic, we have V_§_||(f,0) = 0, and conclude 


dP_ 

ds 2 


O 

£ (0) = / < v * v *gf(c »»* 


/ / n / "9c 9c \ 9c 9c, , 

" I di’ds) ’9^9t )di|s=0 


9c dc .. t - bs - o 
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Similarly 




dt 


|«=o 


9c 


9c , 


= ind /(Va-(t,0),Va-(i,0))dt 


9t 9 s 


9c 9c \ 9c 9c 
. 9f 7 9s / 9s 7 dt 

I /V7 ^ C \ it=6,s=0 1 

+(V *9s’9i )lt =“’ s =°/ 


)dt | 5 — o 


9s 7 dt 
1 9c 9c 


st 9s 


9 i' 


dt 


= TTIn 1 (VA(</ - 


Also 


9c 9c\ 9 c 9 c 
9t 7 9^J 9? dt' dt 


9s 9s 9t J |«=o 


9s 7 dt 1 

(R(c,c')c',c) 

= ( J R(c,c')-<^,c')^)(c'-(^,c')^),c), 

ll c ll INI ll c ll IMI 

so that for the second variation of L through a proper variation, only the 
component of the variation vector field ^ orthogonal to c appears. □ 

In the same manner, we may consider closed geodesics c : S' 1 — > M. The 
formulae for the second variations of E and L then of course do not contain 
any boundary terms anymore. Otherwise, they remain the same. 


We can already draw some consequences: 

If the sectional curvature of M is nonpositive, the curvature term in the 
second variation formula is always nonnegative, because of the negative sign 
in front of it. The first term only vanishes for parallel variations and is positive 
otherwise. If we consider a proper variation that is nontrivial, i.e. c! ^ 0, we 
get j^E(Q) > 0, hence E(c s ) > E(cq) for sufficiently small |s|. We conclude 
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Corollary 4.1.2 On a manifold with nonpositive sectional curvature, geo- 
desics with fixed endpoints are always locally minimizing. 

(Here, ’’locally minimizing” means that there exists some 5 > 0 such that 
for any (smooth) curve 7 : [a,b\ —> M with 7(a) = c(a),j(b) = c(b ) and 
d('j(t),c(t)) < S for all t G [a, b], we have E( 7) > E(c).) 

Proof Let c : [a, fe] —> M be a smooth geodesic, and let 7 : [a, b] — > M be 
another curve with 7 (a) = c(a),j(b ) = c(b), and such that for no t G [0,6], 
the distance between 7 (t) and c(t) exceeds the injectivity radius of c(t). We 
may then find a smooth geodesic interpolation between c and 7, namely 
the family c(t,s) := exp c ( t ) sexpW 7(f), i.e. a family that satisfies c(f, 0) = 
c(t),c(t, 1) = 7 (t) for all t G [a, 6], and for which all the curves c(t, s) for fixed 
t and s varying in [0,1] are geodesic. Thus, yjf gf(i, s) = 0 for all t and s, 
and from the proof of Theorem 4.1.1 ^l?(s) > 0 for all s G [0,1], not only 
for s = 0. Since j^E(s) | s=0 = 0 as c is geodesic, we conclude E( 7) > E(c). 
(Since we may assume that 7 is parametrized proportionally to arclength, we 
also get L(y) > L(c).) □ 

Although it is a general fact that sufficiently short geodesics are minimiz- 
ing (cf. 1.4), on a positively curved manifold, longer geodesics need not be 
minimizing anymore, as is already seen on S 2 . 

Similarly 

Corollary 4.1.3 On a manifold with negative sectional curvature, closed 
geodesics are strict local minima of E (and L) (except for reparametrizations) . 

Proof. For each variation normal to c the curvature term is positive, because 
of the negative sign in front of it. □ 

On a manifold with vanishing curvature, geodesics are still minimizing, 
but not necessarily strictly so anymore, as the example of a flat torus or 
cylinder shows. On a manifold with positive curvature, closed geodesics in 
general do not minimize anymore, see S 2 again. We want to derive a global 
consequence of this fact. 

Theorem 4.1.2 (Synge). Any compact oriented even- dimensional Rieman- 
nian manifold with positive sectional curvature is simply connected. 

Proof. Otherwise, there exists a nontrivial element of (let Xo G M 

be the base point). Let this element be represented by a closed curve 7 : 
S' 1 — > M. 7 cannot be homotopic to a constant curve even if we do not keep 
the base point fixed. On the other hand, by Theorem 1.4.6, 7 is homotopic 
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to a closed geodesic c of shortest length (and smallest energy) in this free 
homotopy class. Thus, c : S 1 — > M cannot be a constant curve. 

Parallel transport P along c from c(0) to c( 27 t) = c(0) is orientation 
preserving and leaves the orthogonal complement E of c(0) invariant. Since 
E has odd dimension (since M has an even one), there exists a vector v £ E 
with Pv = v. 

Let now X be the parallel vector field along c with X(0) = v. We consider 
a variation c : S' 1 x (—e,e) : ( t , s) e- > c(t , s) of c with c'(t, 0) = X(t) for all t. 

Since c is geodesic, E'( 0) = 0. Since X is parallel and X(0) = X(2tt), 

2t r 

E"( 0) = f(V^X(t),X a X(t))dt 

J dt dt 

0 

2tt 

= - J { R{c,X)X , c)dt 
0 

< 0 . 


2tt 

J (R(c,X)X, c)dt 
o 


Hence 

E(c s ) < E(c) for sufficiently small s, 

and c cannot have least energy in its homotopy class. 

This contradiction proves the claim. □ 


Remark. The previous reasoning would have applied to L instead of E as 
well. 


Let now X be a vector field along c, i.e. a section of c*(TM); in the sequel, 
c will always be geodesic. There exists a variation c : [a, b\ x (— e, e) —> M of 
c(t) with §f |s=0 = X. 

We put 


b 

I(X,X) := J ((VaI,VaI)-(J?(c,I)I,c)) dt, 

a 

i.e. 

d 2 

I(X, X) = —E{ 0), if X{a) = 0 = X(b). 

Instead of a 1-parameter variation c(t, s), we may also consider a 2-parameter 
variation and put ( Y := |^) 

b 

I(X, Y) := [ ((V e X,V eY) - {R(c,X)Y,c))dt. 

I dt dt 


(4.1.8) 
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I(X,Y) is bilinear and symmetric in X and Y (by (3.3.10)). 

Definition 4.1.1 / is called the index form of the geodesic c. 

For a vector field X along c that is only piecewise differentiable, we define 
I(X , X) as the sum of the respective expressions on those subintervals where 
X is differentiable. Each piecewise smooth vector field X along c may be 
approximated by smooth vector fields X n in such a manner that I(X n ,X n ) 
converges to I(X, X). For technical purposes, it is useful, however, to consider 
piecewise smooth vector fields. A variation that is piecewise C 2 gives rise to 
a piecewise C l vector field, and vice versa. 


4.2 Jacobi Fields 

Definition 4.2.1 Let c : I —> M be geodesic. A vector field X along c is 
called a Jacobi field if 

ViViI + Jl(I,c)c=0. (4.2.1) 

dt dt 

As an abbreviation, we shall sometimes write 

dt dt dt 

(4.2.1) then becomes 

X + R(X, c)c = 0 . (4.2.2) 

Lemma 4.2.1 A vector field X along a geodesic c : [a, 6] — > M is a Jacobi 
field if and only if the index form of c satisfies 

I(X,Y) = 0 

for all vector fields Y along c with Y(a) —Y ( b ) = 0. 

Proof. 

b 

I(X,Y) = f((ViI,ViF)- ( R(X,c)c,Y))dt 

I dt dt 

a 

using the symmetries of the curvature tensor 

b 

= f ({-X<lX7j_X,Y) ~ (R{X,c)c,Y))dt, 

I dt dt 

a 

since V is metric and Y(a) = 0 = Y(b), 
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and this vanishes for all Y if 

V _d_ V _d_ X + R(X, c)c = 0 

holds (by the fundamental lemma of the calculus of variations) . □ 

Lemma 4.2.2 A vector field X along the geodesic c : [a, b] — > M is a Jacobi 
field if and only if it is a critical point of I(X,X) w.r.t. all variations with 
fixed endpoints, i.e. 

^I(X + sY,X + sY) , s=0 = 0 
for all vector fields Y along c with Y(a) = 0 = Y(b). 

Proof. We compute 

b 

^I(X + sY,X + sY) | S = 0 = 2 f {-(V o_V jlX,Y) - (R(X,c)c,Y))dt 
ds J 9t Bt 

a 

by the proof of Lemma 4.2.1. □ 


The Jacobi equation thus is the Euler-Lagrange equation for 
I(X):=I(X,X). 

More generally, one can consider the second variation for each critical 
point of a variational problem. The second variation then is a quadratic in- 
tegral in the variation vector fields, and the second variation may hence be 
considered as a new variational problem. This new variational problem is 
called accessory variational problem of the original one. Most of the consid- 
erations of this Paragraph may be generalized to such accessory variational 
problems. 

We now want to prove existence and uniqueness of Jacobi fields with given 
initial values. For this purpose, we shall simply interpret the Jacobi equation 
as a system of d(= dim M) linear second order ODEs. 

Lemma 4.2.3 Let c : [ a,b } — > M be geodesic. For any v,w £ T C ( 0 )M, there 
exists a unique Jacobi field X along c with 

X(a) = v,X(a) = w. 


Proof. Let v\ , ... ,Vd be an orthonormal basis of T C ^M. Let Xi, . . . , Xd be 
parallel vector fields along c with Xjf a) = v-i, i = 1, ... ,d. Then, for each 
t € [a,b], X\ (<),... ,Xd(t) is an orthonormal base of T C ^M. An arbitrary 
vector held X along c is written as 


X = ?Xi (£*(*) = <*(*), *(*)». 
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Since the vector fields X, are parallel, we have 


dC d 2 C 

Vil = -fli.ViViI = -7%-Xi. 
dt dt dt dt dt 2 

We likewise write the curvature term in (4.2.1) as a linear combination of the 

R{X i ,c)c=p$X ki 

and then also 

R(X,c)c = CrfX k . 

The Jacobi equation (4.2.1) now becomes 

'd 2 £, k 


+ Cp k )X k = 0, 


dt 2 


i.e. a system of d linear 2nd order ODE 
d 2 t; k {t) 


dt 2 


+ C(t)Pi(t) = 0, k = l,...,d, 


and for such systems, the desired existence and uniqueness result is valid. □ 


It is easy to describe those Jacobi fields that are tangential to c. 

Lemma 4.2.4 Let c : [a, b] — > M be geodesic, A,/j£K. Then the Jacobi field 
X along c with X(a) = Ac(a),X(a) = pc(a) is given by 

X(t) = (A + (t — a)p)c(t). 


Proof. Directly from (4.2.1), since R(c,c) = 0 because of the skew symmetry 
of R. □ 

Thus, tangential Jacobi fields do not depend at all on the geometry of 
M, and hence, they cannot yield any information about the geometry of M. 
Consequently, they are without any interest for us. We shall see in the sequel, 
however, that normal Jacobi fields are extremely useful tools for studying the 
geometry of Riemannian manifolds. 

Examples. 

1) In Euclidean space R", geodesics are straight lines. Jacobi fields are 
linear: Namely, the Jacobi field X along a straight line c with c mit 
X(a) = v,X(a) = w is given by 

X(t) = V(t) + (t- a)W(t), (4.2.3) 

where V ( t ) and W (t) are parallel fields along c with V (a) = v, W (a) = 
w. 
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2) S n C R ra+1 . Let c : [0,T] — > S n be geodesic with ||c|| = 1, v,w € 
T c ( o)5", V, W parallel vector fields along c with V(0) = v,W( 0) = w. 
Assume (u,c(0)) = 0 = (w,c( 0)). We claim that the Jacobi field X 
with A(0) = v. X(0) = w along c is given by 

X(t) = V (t) cos t + W(t) sin t. (4.2.4) 

Namely, since V and W are parallel, 

X(t) = —V(t) sini + W{t) cos t 

X(t) = —V(t) cos t — W(t ) sint. 

By (3.6.12), 

R{X, c)c= (c, c) X — (X, c)c= X, since (c, c) = 1 
and since v and w, hence also V and W are orthogonal to c. 

Hence, 

X + R(X, c)c = 0, 
and X indeed is a Jacobi field. 

Arbitrary initial values that are not necessarily orthogonal to c may be 
split into a tangential and a normal part. The desired Jacobi field then is the 
sum of the corresponding tangential and normal ones, because as (4.2.1) is 
linear the sum of two solutions of (4.2.1) is a solution again. 

If more generally ||c|| = /u, the Jacobi field with initial values v,w normal 
to c is given by 

X(t) = V(t) cos(nt) + W(t) sin(iii) . (4.2.5) 

If we consider more generally the sphere 

S; := {x € R" +1 : |z| = p} 
of radius p, then the curvature is given by 

R{X, Y)Z = \((Y, Z)X - (A, Z)Y) 

P 

and the Jacobi field with initial values v,w normal to c with ||c|| = 1 

X(t) = V(t) cos - + pW(t) sin (4.2.6) 

P P 

Theorem 4.2.1 Let c : [0,T] — > M be geodesic. Let c(t,s) be a variation of 
c(t) (c( v ) : [0,T] x (—e,£) — > M), for which all curves c(-,s) =: c s (-) are 
geodesics, too. Then, 

A [t) . c(t, s)| s=0 

is a Jacobi field along c(t) = co(t). Conversely, every Jacobi field along c(t) 
may be obtained in this way, i.e. by a variation of c(t) through geodesics. 
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Proof. 


■■ dc dc 

X(t) = V V _s_ — = VjlVjltc- 

at at OS |s=0 at 9s Ot\s=0 

dc 


= ViViy ~ R (^r^ir]ir bydef. ofi? 
9s ®*9<| s =o \<9s dt J dt\ s =o 


= -R 


dc dc \ dc 
ds' dt) dt | s =o’ 


since all curves c s are geodesic 


= - R \ x ' |) % t>y "■ 


Thus, X indeed is a Jacobi field. 

Conversely, let X be a Jacobi field along c(t). Let 7 be the geodesic 
7 : (— £,e) — > M with 7 ( 0 ) = c( 0 ), 7 / ( 0 ) = X(0). 

Let V and W be parallel vector fields along 7 with 

14(0) = c(0), W(0) = X(0). 


We put 

c(t,s) := exp 7 (a) (t(V(s) + sW(s))). (4.2.7) 

Then all curves c(-,s) = c s (-) are geodesic (by definition of the exponential 
map), and c(t, 0) = exp c( - 0 ^ £c(0) = c(f). Thus, c(t, s) is a variation of c(t) 
through geodesics. By the first part of the proof, 

Y(t) ■= §f c (tis ) |«=o 

then is a Jacobi field along c 0 . Finally, 


no) 

= £( ex P 7 ( S )°)|s =0 



= £700| s =o = n 0 ) 

by def. of 7 

no) 

= ^ -§1 S )|s =0 



= y -§1 5t C (^> s ) | s=0 5 

since V is torsionfree 


= Va(V( S ) + S lL( S ))| s= o 

= W(0) , since V and W are parallel along 7 

= *( 0 ) • 

Thus, Y is a Jacobi field along Co with the same initial values Y(0), Y(0) as 
X. The uniqueness result of Lemma 4.2.3 implies X = Y. We have thus shown 
that X may be obtained from a variation of c(t) through geodesics. □ 

The computation at the beginning of the previous proof reveals the geo- 
metric origin of the Jacobi equation: 
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Let c(t, s) = c s (t ) be a family of geodesics parametrized by s, i.e. 


V. !«,») = 0 


for alls. 


Then also 


Va 

ds 


_ dc 
V a — 
<9f 


(t, s) = 0, 


and this implies that X(t) = §f (t, s)| s =o satisfies the Jacobi equation. Conse- 
quently, the Jacobi equation is the linearization of the equation for geodesic 
curves. This also illuminates the relation between Jacobi fields and the in- 
dex form. If one has in particular a proper variation of a geodesic through 
geodesics, then also the 2nd derivative of the length and energy functionals 
w.r.t. the family parameter vanish. 

As an example, consider the family of great semicircles on S n through 
two fixed antipodal points, e.g. north pole and south pole. Here, the length 
is even constant on the whole family. 

The theory of Jacobi fields can be generalized to other variational prob- 
lems, and actually, this theory was already conceived by Jacobi in general 
form. 


Corollary 4.2.1 Every Killing field X on M is a Jacobi field along any 
geodesic c in M. 

Proof. By Lemma 1.6.7, a Killing field X generates a local 1-parameter group 
of isometries. Isometries map geodesics to geodesics. Thus, X generates a 
variation of c through geodesics. Theorem 4.2.1 then implies the claim. □ 

Corollary 4.2.2 Let c : [0,T] — > M be a geodesic, p = c(0), i.e. 

c(t) = exp p tc(0). 

For w £ T p M , the Jacobi field X along c with X(0) = 0, A(0) = w then is 
given by 

X(t) = (Dexp p )(tc(0))(tw) or, in different notation, exp p (tw) 

(the derivative of the exponential map exp p : T p M — > M, (4.2.8) 

evaluated at the point tc( 0) £ T p M and applied to tw) . 


Proof. c(t, s ) := exp p t(c(0) + sw ) is a variation of c(t) through geodesics, and 
by Theorem 4.2.1, the corresponding Jacobi field is 

x (t) = ^c(M)| s =o = (-D exp p ) (ic(0) ) (tw ) , 
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and 


X(0) = (£ex Pp )(0)(0) = 0 

A(0) = w (as in the proof of Theorem 4.2.1). □ 


Consequently, the derivative of the exponential map can be computed 
from Jacobi fields along radial geodesics. 

Corollary 4.2.2 yields an alternative method for a quick computation of 
the curvature tensor of S n . Let Xo G S n ,z € T Xo S n with ||z|| = 1. The 
geodesic c : R. — > S n with c(0) = j; 0 > c(0) = z then is given by 

c(t) = (cos t)x o + (sin t)z . 

Let w G T Xo S n , |H| = 1, (w, z) = 0 

c(t,s ) = (cosf)a;o + (sinf)((cos s)z + (sins)w) 

then is a variation of c(t) through geodesics. Furthermore, the vector field 
along c(t) defined by W(t) = w is parallel (cf. Theorem 3.4.1). Hence, the 
corresponding Jacobi field is 

X(t) = ^ c (t,s) | s=0 = (sint)JF(t) (cf. (4.2.4)). 

We have 

X(t) + X(t) = 0. 

The Jacobi equation then implies 

X(t) = R(X(t), c)c, 


and in particular 

(R(w, z)z,w) = 1 = (w,w)(z,z) — ( w,z ) 2 . 

Lemma 3.3.3 implies 

(R(u,v)w, z) = ( u,z)(v,w ) — ( u,w){v,z ), i.e. (3.4.11). 

Another consequence is the so called Gauss lemma: 

Corollary 4.2.3 Let p G M,v G T p M,c(t) := exp tv the geodesic with 
c(0) = p, c(0) = v (t G [0, 1]), assuming that v is contained in the domain of 
definition of exp p . Then for any w G T p M 

{ v , w) = (( D v exp p )v, (A. exp p )w), (4.2.9) 

where D v exp p , the derivative of exp at the point v, is applied to the vectors 
v and w considered as vectors tangent to T p M at the point v. 
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Proof. By Corollary 4.2.2, 

X(t) = D tv exp p (tw) (4.2.10) 

is a Jacobi field along c, and 

1(0) = w, 

and hence 

(v,w) = (c(0),l(0)). (4.2.11) 

We split X(t) into a part X tan tangential to c and a part X nor normal to c. 
By Lemma 4.2.4 

X tan {t) = t/jbc(t), with ! tan (0) = Atc(0). (4.2.12) 

Hence 

(v,w)= (c(0), X tan (0)) with (4.2.11) and since 

m,x™(t)} = o 

= (c(l),X tan (l)) with (4.2.12) 

= (c(l),X(l)) since (c(t) , X no1 (t)) = 0 

= {(D v exp p )v, (D v exp p )w) with (4.2.10). □ 


(4.2.9) means that exp p is a radial isometry in the sense that the length 
of the radial component of any vector tangent to T p M is preserved. If a curve 
7(s) in T p M intersects the radius orthogonally, then the curve exp p 7(s) in 
M intersects the geodesic c(t) = exp p tv orthogonally as well. In particular, 
c(t) = exp p tv is orthogonal to the images of all distance spheres in T p M. 
Moreover, we may repeat Corollary 1.4.2: 

Corollary 4.2.4 Let p £ M, and let v £ T p M be contained in the domain 
of definition of ex p p , and let c(t) = exp p t.v. Let the piecewise smooth curve 
7 : [0,1] — » T p M be likewise contained in the domain of definition of ex p p , 
and assume 7(0) = 0,7(1) = v. Then 

HI = £(exp p iV| te[0)1] ) < L(exp p 07), (4.2.13) 

and equality holds if and only if 7 differs from the curve tv,t £ [0, 1] only by 
reparametrization. 

Proof. We shall show that any piecewise smooth curve 7 : [0, 1] — > T p M with 
7(0) = 0 satisfies 

£( ex P P 7) > ||7(l)ll , (4.2.14) 

with equality precisely for those curves whose image under exp p is the radius 
£7(1), 0 < t < 1. This will then imply (4.2.13). 
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We write 

7 (t) = r(t)ip(t) ( r(t ) G K, <p(t) G T P M) 

with ||<^(f)|| = 1 (polar coordinates in T p M ). Applying the subsequent esti- 
mates on any subinterval of [0, 1] on which 7 is differentiable, we may assume 
from the onset that 7 is smooth everywhere. 

We have 


7 (t) = r{t)<p(t) + r(t)ip(t) with (<p(t) , <p(t)) = 0. 

Thus, by Corollary 4.2.2, also 

(D 7{t) exp(f(t),D^ t) expip(t)) = 0, \\D l(t) exp ip(t)\\ = ||y>(t)|| = 1, 

and it follows that 

||(exp p 07) WII = ||(-D 7 W ex Pp)(7W)ll 

> \r{t)\, 


i( ex Pp7 ) = J ||(exp p o 7 )'(t)||dt > J \r(t)\dt > r(l) — r(0) = ||7(1)||, 

0 0 

with equality only, if ip(t) = 0 and r(f) is monotone, i.e. if 7 (t) coincides with 
the radial curve £7(1), 0 < t < 1 up to reparametrization. □ 

We point out that alternatively, one can also prove Corollaries 4.2.3 and 
4.2.4 with the arguments of the proofs of Theorem 1.4.5 and Corollary 1.4.2. 

Corollary 4.2.4 by no means implies that the geodesic c(t) = exp tv is 
the shortest connection between its end points. It only is shorter than any 
other curve that is the exponential image of a curve with the same initial and 
end points as the ray tv, 0 < t < 1. 


4.3 Conjugate Points and Distance Minimizing 
Geodesics 

Definition 4.3.1 Let c : I — > M be geodesic. For to,t\ G I, to ^ t\, c(t 0) 
and c(t\) are called conjugate along c if there exists a Jacobi field X(t) along 
c that does not vanish identically, but satisfies 

X(t 0 ) =0 = X(t 1 ). 


Of course, such a Jacobi field X is always normal to c (Lemma 4.2.4). 
If to,ti G I,to 7^ t±, are not conjugate along c, then for v G T c t to )M,w G 
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T c ( tl ) M, there exists a unique Jacobi field Y(t) along c with Y(t 0 ) = 
v,Y(ti) = w. Namely, let J c be the vector space of Jacobi fields along c 
(dim J c = 2dimM by Lemma 4.2.3). We define a linear map 

A: J c —> T c p 0 )M x T c ( tl )M 

by 

A(Y) = (Y(t 0 ),Y( tl )). 

Since to and t\ are not conjugate along c, the kernel of A is trivial, and A is 
injective, hence bijective as domain and range of A have the same dimension. 

Theorem 4.3.1 Let c : [a, b] — > M be geodesic. 

(i) If there does not exist a point conjugate to c(a) along c, then there 
exists e > 0 with the property that for any piecewise smooth curve 

g : [a, b\ -> M 

with g{a) = c{a),g{b) = c{b), d{g{t) , c{t)) < e for all t. £ [a, b\, we have 

L(g) > L{c) (4.3.1) 

with equality if and only if g is a reparametrization of c. 

(ii) If there does exist r € (a, b) for which c(a ) and c(t) are conjugate 
along c, then there exists a proper variation 

c(t , s) : [a, b] x (— e, e) — > M 

with 

L(c s ) < L(c) forO<\s\<e (c s (t) := c(t, s)). (4.3.2) 


Proof. 

(i) We want to apply Corollary 4.2.4. We therefore have to show that in 
the absence of conjugate points, for each curve as in (i), there exists 
a curve 7 as described in Corollary 4.2.4. W.l.o.g. a = 0,b = 1. We 
put v := c(0). 

By Corollary 4.2.2, since there are no conjugate points along c, the 
exponential map exp p is of maximal rank along any radial curve 
tv, 0 < t < 1. Thus, by the inverse function theorem, for each 
such t, exp p is a diffeomorphism in a suitable neighborhood of tv. 
We cover {tv, 0 < t < 1} by finitely many such neighborhoods 
fii, i = 1 , . . . , k; Ui := exp p fl i: . 

Let us assume 

tv € fii for ti - 1 <t <ti {to = 0, tk = 1)- 
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If e > 0 is sufficiently small, we have for any curve g : [0, 1] — » M 
satisfying the assumptions of (i), 

g{[ti-uti]) C Ut. (4.3.3) 

We now claim that for any g satisfying (4.3.3), there exists a curve 7 
in T p M with exp p 7 = g, 7 ( 0 ) = 0 , 7 ( 1 ) = v. 

For this, we simply put 

lit) = (exp p | fi .) -1 (g(f)) for < t < U 

7 then satisfies the assumption of Corollary 4.2.4, and we obtain (i). 

(ii) Again, w.l.o.g. a = 0, b = 1. Let X be a nontrivial Jacobi field along c 
with X(0) = 0 = X(t). We have X(t) / 0, since otherwise X = 0 by 
the uniqueness result of Lemma 4.2.3. Let now Z(t) be an arbitrary 
vector field along c with 

Z(0) = 0=Z(1),Z(t) = -X(t). 


For g > 0, we put 

Yj-(t):= X(t) + T)Z(t) for 0 < t < r 
Yp {t):= r]Z(t) for r < t < 1 


Y v (t) := 


Y}(t ) for 0 < t < t 
Yp ( t ) for t < t < 1 


With Z 1 := Z l[0 ' T] ,Z 2 := Z\y T ^ we have 


J i Y V’ Y v) 

I(yl Y n) 


{X{T),2gZ{r)) + rfl{Z\Z l ) 
-2g\\ X{r)f + r?I{Z\Z 1 ) 
g 2 I(Z 2 ,Z 2 ). 


Hence 


I(Y V , Y„) = I(Y V \Y;) + I(Y 2 , Y 2 ) = -2 V \\X(t)\\ 2 + g 2 I{Z, Z) 

for sufficiently small g > 0. The variation c(t, s) := exp c ( t ) sY v (t ) then 
satisfies (with L(s) := L(c s )) 

L'(0) = 0, L"(0) = I{Y V , Y v ) < 0, 

and the claim follows from Taylor’s theorem. □ 


Theorem 4.3. l(i) implies only that in the absence of conjugate points, a 
geodesic is length minimizing when compared with sufficiently close curves. 
As is seen by considering geodesics on a flat cylinder or torus that wind 
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around more than once, even when there are no conjugate points, a geodesic 
need not be the global shortest connection between its end points. 

On the sphere S n , on any geodesic the first point conjugate to the initial 
point is reached precisely after travelling a semi circle (see (4.2.4)). By Theo- 
rem 4.3.1 consequently each geodesic arc shorter than a great semi circle, i.e. 
shorter than n, is locally length minimizing, whereas any geodesic arc on S n 
longer than n is not even locally the shortest connection of its end points. 
For a curve c : [a, b\ — > M let V c be the space of vector fields along c, i.e. 

V c = r(c*TM), 


and let V c be the space of vector fields along c satisfying V(a) = V(b) = 0. 

Lemma 4.3.1 Let c : [ a,b } — > M be geodesic. Then there is no pair of 
conjugate points along c if and only if the index form I of c is positive definite 

O 

on V c . 

Proof. Assume that c has no conjugate points. Theorem 4.3. 1 (i) implies 

I(X, X)>0 for all X € V c , (4.3.4) 

because otherwise c(t , s) := exp c ( t ) sX(t) would be a locally length decreasing 

O 

deformation. If I(Y, Y) = 0 for some Y G V c , then by (4.3.4) for all 
Z G V c , A G R. 0 < I(Y - XZ, Y - XZ) = -2X1 (Y, Z) + A 2 I(Z, Z) , and hence 

O 

I(Y,Z) = 0 for all Z G V c . Lemma 4.2.1 then implies that Y is a Jacobi 
field. Since there are no conjugate points along c, we get Y = 0. Hence, / is 
positive definite. 

Now assume that for t 0 ,ti G [a,b] (w.l.o.g. to < ti), c(to) and c(ti) are 
conjugate along c. Then there exists a nontrivial Jacobi field X along c with 
X(t 0 ) = 0 = Xih). We put 

(0 for a < t < to 
Y(t) := < X(t) for to<t<t\ . 

I 0 for t\ < t < b 

Then I(Y,Y) = 0, and / is not positive definite. □ 


We now introduce the following norm on V c : 

b 

11*11 := ( / «*, *) + <*, X))dt) 5 • (4.3.5) 

a 

o o 


Let H l be the completion of V c w.r.t. || • ||. 
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Introducing an orthonormal basis {V t } of parallel vector fields (i = 
1 , ,d = dim M) and writing 

X = ?Vi, 

we have X = ffVi, and 

b 

11*11 = (J(ce+ee)dt) h . 

a 

o o 

Hence, H \ can be identified with the Sobolev space H 1,2 (/, R d ). We now 

O 

consider the index form of c as a quadratic form on H }. : 

I:H\ xHl-tR, 


b 


I(X, Y) = j ((X, Y) - (R(c, X)Y, c))dt . 

a 

(4.3.6) 

Definition 4.3.2 The index of c, Ind(c), is the dimension of the largest 

O 

subspace of H\, on which / is negative definite, and the extended index of 

c, Ind 0 (c), is the dimension of the largest subspace of 
negative semidefinite. Finally, the nullity of c is 

on which / is 

N(c) := Indo(c) — Ind(c). 


Lemma 4.3.2 Ind(c) and N(c ) are finite. 


Proof. Otherwise, there exists a sequence (X„)„ g j*j with 


I(X n ,X n ) <0 

(4.3.7) 

and 

b 


[ (*n> Xjjfjdt — S nm 

(4.3.8) 


for all n,m £ N. ((4.3.8) means that ( X n ) is an orthonormal sequence w.r.t. 
the L 2 -product.) 

(4.3.7) and (4.3.8) imply 

b 

j ( X n , X n ) < sup \R\ E(c) (where R is the curvature tensor of M). (4.3.9) 

a 
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By (4.3.8) and (4.3.9) 

IPU < const. (4.3.10) 

By Rellich’s Theorem (Corollary A. 1.3), a subsequence converges in L 2 . This, 
however, is not compatible with (4.3.8), since an orthonormal sequence cannot 
be a Cauchy sequence. □ 

For t € (a, b) let ./* be the space of Jacobi fields X along c with X(a) = 
0 = X(t). 

Lemma 4.3.3 N(c ) = dim Jf. 

Proof. From Lemma 4.2.1. □ 

We now want to derive the Morse Index Theorem. 

Theorem 4.3.2 Let c : [a, b] — > M be geodesic. Then there are at most 
finitely many points conjugate to c(a ) along c, and 

Ind(c) = ^ dimJ*, (4.3.11) 

t(z(a,b ) 

Indo(c) = dimJ* . (4.3.12) 

££(a,6] 


Proof. For each ti € (a, 6], for which c(t,) is conjugate to c(o), there exists a 
Jacobi field X; along c with A) (a) = 0 = X,(i,). We put 



for a < t < ti 
otherwise 


The Yi are linearly independent, and /(F);, Y. t ) = 0 for all i. Therefore, the 
number of conjugate points is at most Indo(c), hence finite by Lemma 4.3.2. 
For r £ (a, b ], we put 


V?(r) := Ind(c| [a>T] ), ip 0 (r) = Ind 0 (c| [O;T] ). 


(i) tp(r) is left continuous. 

(ii) <Po(t) is right continuous. 

Proof of (i). For r £ (a, b] let I T be the index form of C|[ a r ] . Let the vector 
field X along C| [«., T ] satisfy I T (X,X) < 0,||A|| = 1. We consider the vector 
field X defined by X{t) := X{^t) on [a, a\. 
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Then 


hence 


[ (X(t),X(t))dt = [ C-) 2 (X{-t),xC-t))dt 

J J cr a a 

o o 

T 

= C a ) J(X(s),X(s))ds, 

0 


J(X(t),X(t))dt^ J {X(t),X(t))dt 


for a 


T. 


0 


0 


Moreover, because of ||X|| = 1, X is continuous by the Sobolev embedding 
theorem (Theorem A. 1.7). Hence, X also converges pointwise to X as a — » r, 
hence also 


<7 



(. R(c,X)X,c)dt 


J (R(c,X)X, c)dt 


for a 


T. 


0 


0 


We conclude 

I a (X,X) -► I T (X,X) for cr — » r. 

In particular, 

I a (X,X) < 0, if cr is sufficiently close tor. 


For each orthonormal basis of a space on which I T is negative definite, we 
may thus find a basis of some space on which I a is negative definite, provided 
a is sufficiently close to r. 

Since ip is monotonically increasing, this implies the left continuity of ip. 

Proof of (ii). Let (r n ) raS N C (a, b] converge to r € (a, b ]. For each n € N, let 
X n be a vector field along C|[ a>r i with ||X n || = 1 and I Tn (X n ,X n ) < 0. After 
selecting a subsequence, X n converges weakly in the Sobolev H 1,2 topology 
to some vector held X along C| [«, T ] (cf. Theorem A. 1.9). Then 

T T n 

/ ( X,X)dt < liminf [ (X n ,X n )dt. 

J n *oo J 

0 0 

Furthermore, by Rellich’s theorem (Corollary A. 1.3), X n also converges 
(strongly) in L 2 , hence 


T T n 

[ (R{c,X)X,c)dt= lim f (R(c,X)X,c)dt. 

J n ^°° J 


o 


0 
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We conclude 


I T (X,X) < lim inf I T (X n , X n ) < 0. 

We also need to check that X does not vanish identically. Since I(X n , X n ) < 
0, we have 

T n T n 

j ^ X n ,X n t> dt ' k j X n ,X n i> dt, 

0 0 

where the constant k depends on the norm of c and the curvature tensor R. 
Since the Sobolev norm ||X n || = 1, this implies that the rhs cannot go to 
0 as otherwise so would the lhs, and then also ||X n || would go to 0. Since 
X n converges strongly to X in L 2 , by Relich’s theorem, the L 2 -norm of X 
is positive as well. Moreover, by a similar argument, if we have two such 
sequences (Xf),(X 2 ), with f < Xf,X 2 > dt = 0 for all n, then the same 
holds for the limits X x ,X 2 . 

Since <po is monotonically increasing, this implies the right continuity of 
^o- 

We can now easily conclude the proof of Theorem 4.3.2: 

Let a < t\ < t -2 < ■ ■ ■ < tk < b be the points for which c(tj) is conjugate 
to c(a). Lemma 4.3.3 implies 

<p 0 (t) - ip(t) = 0 fort g (a, b\\{tu...,tk}. (4.3.13) 

Hence 

k 

dim J* = Y (^o(i) - <p(t)) = ^(^°(ti) - <p(U)). 

t£(a,b] i = 1 

Since <p is left continuous and (fig is right continuous, we have 
<Po(ti) = <p(ti+i) (i = 1, ■ ■ ■ , k — 1). 


Hence 

k 

Efoft) - <p(ti)) = <Po(t/c) - <p(h)- 

i=l 

Since Lp is left continuous, Lemma 4.3.1 implies ^(ti) = 0. The continuity 
properties of ip and ip o and (4.3.13) imply that <p and (po can jump only at 
those points r where <Po(t ) ^ p(t), i.e. at the conjugate points. In particu- 
lar, ip o is constant on [tk,b\, hence Po(tk) = < Po(^)- Altogether, we conclude 
<Po(b) = E dim J*, i.e. (4.3.12). (4.3.11) then follows with the help of 

t£(a,b\ 

Lemma 4.3.3. □ 

As an application of the second variation, we now present the Theorem 
of Bonnet-Myers: 
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Corollary 4.3.1 Let M be a Riemannian manifold of dimension n with Ricci 
curvature > A > 0, i.e. 

Ric(X, X) > X(X, X) for allX G TM. 

Let M be complete in the sense that it is closed and any two points can be 
joined by a shortest geodesic (cf. the Hopf-Rinow Theorem l.f.8). Then the 

diameter of M is less or equal to n^j In particular, M is compact. Also, 
M has finite fundamental group tti(M). 

Remark. The diameter is defined as 

diam(M) := sup d(p,q), 

p,q£M 

where d{ ■, •) denotes the distance function of the Riemannian metric. 

The sphere 

S n {r) := {x G R n+1 : |x| = r} 

of radius r has curvature , hence Ricci curvature and diameter nr. We 
choose r such that A = Corollary 4.3.1 then means that if M has Ricci 
curvature not less than the one of S n (r), then the diameter of M is at most 
the one of S n (r). 

Proof. For each p < diam(M), there exist p,q G M with d{p, q) = p and then 
by the completeness assumption a shortest geodesic arc c : [0, p] — > M with 
c(0) = p, c(p) = q. Let ei, . . . , e n be an orthonormal basis of T p M, e\ = c(0). 
As usual, from this, we may construct a parallel orthonormal basis 

{c(t),X 2 (t),...,X n (t)} 

along c. With Yj(t) : (sin y)Xj(t), i = 2, . . . , n we have 

p 

I(Yi,Yi) = J {—{Yi, Yf) - (R(Yi, c)c, Yf))dt 
0 

P r 2 

= J (sin 2 - {R(X i ,c)c,X i )^jdt. 

o 

Since c is the shortest connection of its end points, by Theorem 4.3.1 (ii), 
there is no pair of conjugate points in the interior of c, and Lemma 4.3.1 
implies 

I{Yi,Yi) > 0 for all*, 

hence also 
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n r jrf 

0 < y ^I(Yi,Yi) = I (sin 2 — )( — (n- 1) - Ric(c, c))dt 
i = 2 o'' 

< (—v(n — 1) — A) / sin 2 —dt, 

P- J P 

o 

since the Y t form an orthonormal basis of the subspace of T c ( t ) M normal to 
c. Consequently, p < and since this holds for any p < diam(M), we 

obtain the estimate for the diameter. The universal cover of M satisfies the 
same assumption on the Ricci curvature. Hence, it is compact as well. This 
implies that the group of covering transformations, i.e. ni(M ), is finite. □ 


4.4 Riemannian Manifolds of Constant Curvature 

We have already met Euclidean spaces and spheres as Riemannian manifolds 
of vanishing and constant positive sectional curvature, resp. We now want 
to discuss hyperbolic space as an example of a Riemannian manifold with 
constant negative sectional curvature. 

For this purpose, we equip R" +1 with the quadratic form 

{x, x) := —(a; 0 ) 2 + (a; 1 ) 2 + . . . + ( x n ) 2 (x = (a: 0 , . . . , x n )). 

We define 

H n := {x G R” +1 : (x,x) = -l,x° > 0}. 

Thus, H n is a hyperboloid of revolution; the condition a; 0 > 0 ensures that 
H n is connected. 

The symmetric bilinear form 

I := -(dx 0 ) 2 + (dx 1 ) 2 + . . . + (dx n ) 2 

induces a positive definite symmetric bilinear form on H n . Namely, if p G H n , 
T p H n is orthogonal to p w.r.t. (•, •). Therefore, the restriction of I to T p H n is 
positive definite by Sylvester’s theorem. We thus obtain a Riemannian metric 
(•, •) on H n . The resulting Riemannian manifold is called hyperbolic space. 

Let 0(n, 1) be the group of those linear self maps of R" +1 that leave the 
form (., .) invariant. Those elements of 0(n, 1) that map the positive ar-axis 
onto itself, then also leave H n invariant and operate on H n by isometries. This 
is completely analogous to the isometric operation of 0(n+ 1) on S n C R n+1 . 
As we have seen in 1.4 for S n , we see here that the geodesics of H n are 
precisely the intersections of H n with twodimensional linear subspaces of 
R n+1 . 

If p G H n ,v G T p H n with ||ti|| = 1, the geodesic c : R — > H n with 
c(0) = p, c(0) = v is given by 
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c(t) = (cosh t)p+ (sinh t)v. 


Indeed, 

(c(t),c(t)) = — cosh 2 t + sinh 2 t = — lj 
since (p,p) = — 1, (p, v) = 0, (v, v) = 1 

and ( c(t),c(t )} = — sinh 2 t + cosh 2 1 = 1. As on S", we may now compute the 
curvature with the help of Jacobi fields. 

For this, let w € T p H n , (w,w) = 1, (w, v) = 0. We then obtain a family 
of geodesics 

c(t, s) := (cosh t)p + sinh t(cos sv + sin s w) . 

The corresponding Jacobi field 

d 

X{t) = —c(t, s)| s=0 = (sinh t)w 

then satisfies 

m=x(t). 

The Jacobi equation implies R(X, c)c = — X , and so, the sectional curvature 
is —1. 

We may then also obtain a space H n {p) of constant sectional curvature 
— p by scaling the metric with factor p and considering 

<•, V= PM- 


4.5 The Rauch Comparison Theorems and Other 
Jacobi Field Estimates 

We first compare the three model spaces S n 1 M n ,H n of curvature 1,0,— 1. 
Let c(t) be a geodesic with ||c|| = 1, v € T C ( 0 )M , M € {S'”,!!”,#"} with 
|M| = 1. The Jacobi field J(t) along c with 

J(0) = 0, J(0) = V 

is given by 

(sin t)v,tv, (sinht)u resp. 

According to our geometric interpretation of Jacobi fields as infinitesimal 
families of geodesics (Theorem 4.2.1) this means, that on S", geodesics with 
the same initial point initially diverge, but then converge again, whereas 
such geodesics diverge linearly on R" and even exponentially on H" . 

Let now M be a Riemannian manifold with curvature K satisfying 


A < K < p 
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and suppose initially A < 0,p > 0. We shall estimate a Jacobi field in M 
from above by the Jacobi field in H n {— A) with initial values of same lengths, 
and from below by the corresponding one in S n {p). This implies that the 
distance between geodesics and also the derivative of the exponential map 
of M can be controlled by the geometry of the model spaces H n (— A) and 
S n (p). Since tangential Jacobi fields are always linear (Lemma 4.2.4), hence 
independent of the geometry of M, for our curvature bounds A < K < p, we 
shall need to assume in the sequel A < 0 and p > 0, or else, we shall have to 
restrict attention to Jacobi fields whose tangential component J tan vanishes 
identically. 

For abbreviation, we put for p G M 



f COS (y/pt) 

if p > 0 

Cp{t) 

= \ 1 

O 

II 


{ cosh (s/^pt) 

if p < 0 

and 

[ 77 sin (y/pt) 

if p > 0 

s p {t) := \ 


II 

o 

1 

l 7 ^ sinh (y/=pt) 

if p < 0 

These functions are solutions of the Jacobi equation for constant sectional 

curvature p, namely 

f(t ) + pf(t) = o 

(4.5.1) 

with initial values /( 0) = 

l,/(0) = 0, resp. 

/( o) = 0,/(0) = 1 . c(t) will 


always be a geodesic on M parametrized by arc length, i.e. satisfying 

||c|| = l. (4.5.2) 

Let J(t) be a Jacobi field along c(t). 


Theorem 4.5.1 Suppose K < p, and as always, ||c|| = 1. Assume either 


p > 0 or 

J tan = 0. Let fp := | J(0)\cp + |J|-(0)s„ solve 



f + pf = 0 


with /( 0) 

= \m\,m = 14(0), i.e. u = l^( 0 )|c M + 1 4(0)v 


If 

U{t)> o for 0 < t < t, 

(4.5.3) 

then 

(J,j)fp > (J,J)U on [0, r] 

(4.5.4) 


! . I4*l)l . 1^2)1 ifn _ 

l£ Mh) s Mhr ao<t,£h<r 

(4.5.5) 


|J(0)|c #1 (t) + |J '(0)s M (t) < \ J(t)\ for 0 < t < T . 

(4.5.6) 
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We point out that the assumption (4.5.3), i.e. 

U(t)> 0 on (0, r) 

is indeed necessary. To see this, let M = S n (/i — e), J( 0) = 0; f^{t) then has 
a zero at t = J(t) one at t = ^==. In particular, for small positive e 

and any t which is only a little larger than y=j, we have < 1, and for 
example, (4.5.5) does not hold anymore. 

Proof. 

| J| + /r| J\ = j-yj (— (R(J, c)c, J) + /i(J, </)) 

+ Tyrjd^l 2 M| 2 — {J> j} 2 ) > 0 because K < n, 

Mr 

for 0 < t < r, provided J has no zero on (0, r). 

We then also have 

(\j[u-\j\ur = \j\-u-\j\u>o, 

since / M + /. t/ M = 0, provided / M (f) > 0. 

Because of | J|(0) = / M ( 0), |J|'(0) = / M (0), we conclude 

\J\fu - \J\U > o, 

i.e. (4.5.4). 

Next 

J m / n 

and from this and the initial conditions, we get (4.5.5). In particular, the 
first zero of J cannot occur before the first zero of Z^, and the preceding 
considerations are valid on (0,r). 

(4.5.5) implies (4.5.6). □ 

Corollary 4.5.1 Assume K < fa, > 0 on (0,r), and in addition either 
/i > 0 or J tan = 0. 

Furthermore, let ||c|| = 1, J(0) = 0, |J?| < A where R stands for the curvature 
tensor. 

Then 

\J{t)-tj{t)\<\J{r)\ l -At\ (4.5.7) 

Proof. Let P be a parallel vector field of length 1 along c, t G (0, r) 
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\(J(t)-tj(t),P(t)Y\ = \t(R(J,c)c,P)(t)\ 

< At\J{t)\ 

by (4.5.5) because of J(0) = 0 

< At\J(r)\, since c p > 0on[0,r]. 

Integrating this yields (4.5.7), as J( 0) = 0. □ 

We now want to study the influence of lower curvature bounds. It will 
turn out that this is more complicated than for upper curvature bounds. 

Theorem 4.5.2 Assume A < K < p. and either A < 0 or J tan = 0; ||c|| = 1. 
Moreover, let J(0) and J(0) be linearly dependent. 

Assume 

s § ( a+a 0 > 0 on(0,r). (4.5.8) 

Then for 0 < t < r 

|J(t)|<|J(0)|c A (t) + |Jr(0) SA (t). (4-5.9) 


Proof. Let p G R, p := max(/r — p, p — A). 

Let A be the vector field along c with 

A + pA = 0, A(0) = J(0), i(0) = J(0). (4.5.10) 

((4.5.10) is a system of linear 2nd order ODEs, and hence, for given initial 
value and initial derivative, there is a unique solution.) Let a : I — ■> R be the 
solution of 

a + (p — rf)a = rj\A\, a(0) = a(0) = 0, (4.5.11) 

and let b : I — > R. be the solution of 

b+ pb = r]\J\,b(0) = 6(0) = 0 (4.5.12) 

(since (4.5.11) and (4.5.12) are linear 2nd order ODE, too, again there exist 
unique solutions) . 

For each vector field P along c with ||P|| = 1, we then have by (4.5.10) 

\(j-A,py + P (j-A,p)\ = \(j + pj,p)\<n\j\ 

by choice of p and since J solves the Jacobi equation. 

Therefore, by (4.5.12) for d := (( J — A, P) — b)'s p — ((J — A,P) — b)s p 

d=((J - A, P) - b)-s p - (( J — A, P) — b)s p < 0, 

and hence, if s p > 0 on (0, t], because d( 0) = 0 
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(I«j - A ,P)~ = §^<0. (4.5.13) 

S p S p\ l ) 


Note that ( J — A, P) — b has a second order zero at t = 0, and hence 
({J — A, P) — b) vanishes for t = 0. 

Therefore, we obtain from (4.5.13) 


— (( J — A, P) — b) <0 on (0, r) . 

Sp 

(4.5.14) 

If s p > 0 on (0, r), this implies 


J — A < b on (0, r) 

(4.5.15) 

and by (4.5.12) then 

b+ (p — r])b< rj\A\. 

(4.5.16) 

From (4.5.12) and (4.5.16) we conclude with the same argument 
leading to (4.5.14), 

( b — a) < 0, 

s p-v 

as the one 

i.e. 

b < a 

(4.5.17) 

provided 

s p _ v > 0 on (0, r). 


From (4.5.15) and (4.5.17) 

\J-A\ < a. 

(4.5.18) 

Now by (4.5.10) 

((A,A)(A,A)- (A,A)(A,A)y = 0 

(4.5.19) 

and hence 

{A, A) (A, A) -(A, A) (A, A) =0, 

(4.5.20) 


because this expression vanishes for t = 0, since A(0) = J(0) and ,4(0) = J(0) 
are linearly dependent by assumption. This implies 


14 + p\A\ — 0, 


i.e. putting 


/ ct = |J(0)|c ct + |J|'(0K (4.5.21) 

we have 

14 = f P ■ (4-5.22) 

This implies in turn in conjunction with (4.5.11) 

a = fp—V ~ f p- 


(4.5.23) 
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(4.5.18), (4.5.22), (4.5.23) yield 

\J\ 5 : fp—rp 

Putting p = \(p + A), i.e. p — rj = A, we get (4.5.9). (Note that then p = 
\{p — A) >0, and hence s p > 0 implies s p - v > 0 on (0,r).) □ 

Theorem 4.5.3 Suppose ||c|| = \,\K\ < A. Let J(0) and j(0) be linearly 
dependent. Let Pt denote parallel transport along c from c(0) to c{t). 

Then 

| J(t) - P t (J( 0) + t j(0))| < |J(0)|(cosh(^4t) - 1) (4.5.24) 

+ | J|'(0)(— != sinh(\/]4t) - t). 

\/A 


Proof. From (4.5.20) 


h'=°- 


This means that ^ is a parallel vector field. In the proof of Theorem 4.5.2, 
we now put p = 0. We then get |A| = p 0 (cf. 4.5.22)), i.e. 

A(t) = P t (J(0) + tj(0)). 

With p = 0, we have ry = A, and hence s p and s p - v > 0 for t > 0, as required 
in the proof of Theorem 4.5.2. (4.5.18) and (4.5.23) then yield the claim. □ 

Remark. If we do not assume ||c|| = 1, in all the preceding estimates, t has 
to be replaced by t||c|| as argument of s T ,c T ,f T etc. 

Namely, let 

be the reparametrization of c by arc length, i.e. ||c|| = 1. 

Then 

is the Jacobi field along c with J( 0) = J(0), J(0) = ^pjp namely, since J 
satisfies the Jacobi equation, J satisfies 

J + R{J, c)c = 0. 

Thus, estimates for J yield corresponding estimates for J. 


Remark. The derivation of the Jacobi field estimates of the present para- 
graph follows P. Buser and H. Karcher, Gromov’s almost flat manifolds, 
Asterisque 81, 1981. 
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Perspectives. The Rauch comparison theorems are infinitesimal comparison re- 
sults for the geometry of a Riemannian manifold in terms of the geometry of spaces 
of constant curvature. 

A global comparison result is Toponogov’s theorem: 

Let M be a Riemannian manifold with sectional curvature K > A. Let A 
be a triangle in M with corners p, q, r and distance minimizing geodesic edges 
c pq , c qr , c pr . Then there exists a geodesic triangle Ao in the simply connected space 
M\ of curvature A with the same side lengths as A and with angles at its corners 
not larger than the ones of A at the corresponding corners. In case A > 0, we have 
in particular 

L(dA) < 


4.6 Geometric Applications of Jacobi Field Estimates 


We first recall Corollary 4.2.2: Let c(t) = exp p tc(0) be geodesic, w € T p M, 
J the Jacobi field along c with J( 0) = 0, j(0) = w. J(t) then yields the 
derivative of the exponential map 

J(t) = (Ac( 0 ) exp p )(tw). (4.6.1) 

We obtain 


Corollary 4.6.1 Let the sectional curvature of M satisfy X < K < fi. Fur- 
thermore, let (w,c(0)} = 0. Then, provided t||c(0)|| < ^ in case p > 0, 


aWIc(Q)II) 

t\\c 


< l(Ac(o) exp p )H < \w\ 


^(t||c(0)||) 

t||c(0)|| 


(4.6.2) 


(Of course, if w is a multiple of c( 0), we have (D t c(p)exp p )w = w.) 


Proof. For ||c(0)|| = 1, this follows from (4.5.6) and (4.5.9). 

We now put c(t) := exp p t ||^o]|| • c thus is a reparametrization of c, and 

||c|| = 1. Let J be the Jacobi field along c with J( 0) = 0, J(0) = w. Finally, 


(Ac(0) exp p){tw) 


1 

Mi 


(D 


i||c(0)||5(0) 


1 

mm 


J(*||c(0)||), 


exp p )(t||c(0)||w) 


and J(t||c(0)||) is controlled by s M (t||c(0)||) and sa(<||c(0)||) from below and 
above, resp. □ 


Theorem 4.6.1 Let the exponential map exp p : T p M M be a diffeomor- 
phism on {v £ T p M : ||i>|| < p}. Let the curvature of M in the ball 
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B(p,p) := {q G M : d(p,q ) < p} 


satisfy 

and suppose 

A < K < p,, withX < 0, p > 0, 

7T 

p < in case p > 0. 

(4.6.3) 

Let r(x) := d(x. 

2^/Jl 

,p),k(x) := 7jd 2 (x,p). Then k is smooth on B(p,p) 

and sat- 

is fie s 

grad k(x) = — exp“ p 

(4.6.4) 

and therefore 

| grad k(x)\ = r(x). 

(4.6.5) 

y/~jlr(x) ctg (^/Jir(x))\\v\\ 2 < X7dk(v,v) 

(4.6.6) 


< V—X r(x) ctgh (V— Ar(x)) |u | 2 



for x G B(p, p),ve T X M. 


Proof. We have 

gradfc(:r) = — exp” 1 ^, 

because the gradient of k is orthogonal to the level surfaces of k. and those 
are the spheres S(p,r ) := {<7 G M : d(p,q) = r} = exp p {u G T p M : ||n|| = 
r} ( r < p)\ in particular, the gradient of k has length d(x,p ), proving (4.6.5). 

The Hessian X/dk of k is symmetric, and can hence be diagonalized. It 
thus suffices to show (4.6.6) for each eigen direction v of Xdk. Let y(s) be 
the curve in M with y(0) = £, Y(0) = v. 

c(t,s) := exp 7(s) (texp“ 1 s) p), (4.6.7) 

in particular c(0, s) = j(s), c( 1, s) = p. 

Then by (4.6.4) 

d 

(grad fc)(y(s)) = ~—c(t,s) | t=0 , 

hence 

d 

(V t , grad k){x) = -V a_ — c(t, s)| t =o, s =o 

= -V^^c(t,s) | t=0(g=0 (4.6.8) 

J(t) = f^c(t,s) | s=0 is a Jacobi field along the geodesic from x to p with 
J(0) = 7(0) = v, J(l) = 0 G T p M (by (4.6.7)). (4.6.8) thus implies 


V„gradfc(a;) = — j(0) 
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i.e. 

Vdk(v,v) — (V t , gradfc,v) = — ( j(0), J(0)). (4.6.9) 

Since v is an eigen direction of X7dk, V„gradfc and v, i.e. J(0) and J(0) are 
linearly dependent. (4.5.6) and (4.5.9) imply for t = 1 (J(l) = 0) (recall the 
remark at the end of § 4.5) 

\v\ c n(r(x)) + | J|'(0)s M (r(a;)) < 0 < |w|c A (r(a;)) + | J|'(0)s A (r(a;)) 
and with (4.6.9), this gives (4.6.6). □ 

We want to briefly describe the relation between Jacobi fields and the 2nd 
fundamental form of the distance spheres 

dB(p, r) = {q& M, d(p , q) = r}. 

Assume the hypotheses of Theorem 4.6.1; in particular, assume that exp p is 
a diffeomorphism of {||f|| < p} onto B(p,p), and that r < p. 

We have 


N(x ) = gradfc(x) = — exp^p (by (4.6.4)); (4.6.10) 

where N(x) is the exterior normal vector of the distance up here containing 
x. For the second fundamental form S of the distance sphere and for X 
tangential to this sphere, we then have 


S(X, N ) = V^IV (since N(x) has constant length r 
on dB(p , r) the part of X x N 

normal to dB(p,r) vanishes) (4.6.11) 

= X/ x grad k. 

We now obtain a diffeomorphism from dB{p 1 r) onto dB{p 1 r + t) (assuming 
r + t < p) by 

E t (x) := exp x tN(x) (x € dB(p,r)). 

Let y(s) be a curve in dB(p,r) with y(0) = f,7(0) = x. Then 

J(t) = ^ J Bi(7(s))| s=0 (4.6.12) 

is a Jacobi field along E t (x) with 
J(0) = 7(0) = v 

J( 0) = V_e ^exp 7(s) (tAT(7(s)))= V^^exp T(s) (flV(7(s)))|t=o 

= V a_fV(7(s))| s= o 
= S(v,N) = S(J(0),N). 

Since E t ("/(s)) is a curve in dB(p,r + <), we likewise have 
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j(t) = S(J(t),N). 


(4.6.13) 


We put S t {-) = S{-,N{t)). 

From this, we get 

m = v.(s t (j(t)) = St(j(t)) + s t (j(t)). 

dt 


The Jacobi equation J + R(J, N)N 


0 thus implies a Riccati equation for 


S t - 


St(-) = -R(-,N)N - S t O $(■). (4.6.14) 


Thus, on one hand, (4.6.13) describes the geometry of distance spheres 
through Jacobi fields. On the other hand, solutions of the Riccati equation 
satisfy a 1st order ODE and hence are easy to estimate, and from such es- 
timates one may then obtain Jacobi field estimates. In order to explain this 
last point, let P be a vector field parallel along E t (x) with ||P|| = 1. Then 


(St(P),Py = ~(R(P,N)N,P) - (<S 2 (P), P). (4.6.15) 

Since the 2nd fundamental tensor is symmetric, 

(S 2 (P),P) = (S t (P),S t (P)) (cf. Lemma 3.4.2). (4.6.16) 


We put £(•) = prjj-St(-). Since all expressions in (4.6.15) are quadratically 
homogeneous in ||-/V||, we obtain 


(£(P),Py 


<w iwiVir 

TV N 

- _( (P ’M M’ 


P)-(£(P),£(P)} 

P)-(JC(P),P) 2 . 


(4.6.17) 


If the sectional curvature satisfies A < K, because of ||P|| = 1, 

V 5 ;= (Z(P),P) 

then satisfies the differential equation 

— (4.6.18) 


Now 

ct\(t) := 

satisfies the differential equation 


sx(t) 

s\{t) 


c\{t) 

sx(t) 


cix = -A - ct\, 


and it easily follows that 

ip(t) < ct\(t ) , provided ip(s) > — oofor alls with 0 < s < t. 
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With (4.6.13), we conclude from this for a Jacobi field J along E t with J(0) = 

(\m\ 


(4.6.19) 


i /,\ I (t) — o? 

J 

provided in (0, t\ there is no point conjugate to 0. 

In particular 

|7(t)| < m0)ax(t), 

i.e. a special case of (4.5.9), up to the first conjugate point. 


Perspectives. Let M p be the simply connected space form of curvature p. Let 
V p (r) denote the volume of a ball in M p with radius r. Let M be a Riemannian 
manifold, p £ M,r < i(p) (= injectivity radius of p) (i.e. B(p,r) is disjoint from 
the cut locus of p). We then have the volume comparison theorems of R. Bishop: 

If R.ic(AL) > Ric(Mp), then 

Vol (B(p,r)) < V p (r) 


and P. Gunther: 

If K(M) < p ( K is the sectional curvature), then 
Vol(B(p,r)) > V p (r). 

These estimates are also proved with the help of Jacobi field estimates. 


4.7 Approximate Fundamental Solutions and 
Representation Formulae 

Lemma 4.7.1 Suppose exp p : T p M — > M is a diffeomorphism on the ball 
{v £ T p M : ||i>|| < p}, and suppose the sectional curvature in B(p, p) satisfies 

X < K < p with A < 0, /i > 0, 

put A := max(— A,/i), and assume 

7T 

p < —— in case p > 0. 

Vs 

Then, with r( x) = d(x,p), for x ^ p 

\A\ogr(x)\ < 2A ifn = dimM = 2 (4-7.1) 

Y) 2 

|Z\(r(cc) 2_rl )| < — - — Ar 2 ~ n (x) ifn = dimM > 3. (4.7.2) 


Proof. We prove only (4.7.2) as (4.7.1) is similar. 
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—Ar{x) 2 ~ n = - A(d 2 (x,p .))V 

= 2 -^(-^)(d 2 (x,p))-^\\gradd 2 (x,p )\\ 2 

Now by Theorem 4.6.1 

||gradd 2 (a:,p)|| 2 = 4d 2 (:r,p), 

2n(l — pr 2 ( x)) < —Ad 2 (x,p) < 2n(l — A r 2 (x)) 

(noting — A = trace Vd), 

and (4.7.2) follows. □ 


Lemma 4.7.2 Suppose B(p, p) is as in Lemma 4-7.1. Let u n be the volume 
of the unit sphere in R", n = dim M. For h £ C 2 (B(p, p), R) then (with A as 
in Lemma 4-7-1) 


ifn 


2 | uj 2 h(p) 


I ^h) log r M 
J P 

B(p,p) 


1 

P 


J h\ <2/1 J \h\ (4.7.3) 


dB(p,p) 


B(p,p) 


</n>3|(n-2KA(p)- J ( Ah )( r ^ n _ 2 ~ 

B(p,p) 


l kl 

dB(p,p) 


< 



B{p,p) 


\h\ 

r(x) n ~ 2 


(4.7.4) 


Proof. We prove only (4.7.4) as (4.7.3) is similar. 
We put 

9(x) := r(x) 2 ~ n - p 2 ~ n 

Then for e > 0 


J ( gAh - hAg) = 


(h grad g — g grad h, dl7) . 


B(p,p)\B(p,e) d(B(p,p)\B(p,s )) 

{A? denotes the outer unit normal of d(B(p, p)\B(p, £■)).) 
Now 
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/ ^ by(4J ' 2) 
B(p,p)\B(p,e) B(p,p ) 

J h(gradg,dT?) = J h 

dB(p,p) dB(p,p) 

lim / g(gradh 1 dl?) = 0 
£ — ^0 J 

dB(p,e ) 

lim / (gradg,cfi7) = — (n — 2)ui n h(p), 

£ — ^0 J 

dB(p,e) 

and (4.7.4) follows. □ 

For the interpretation of the preceding formulae, we observe that in the 
Euclidean case 

Z\r(a;) 2 ^ ra = 0 for a; 7^ p, (4.7.5) 

whereas individual second derivatives of r(x) 2_ " grow like r(x)~ n for x — > p. 
Therefore, in the Riemannian case, although (4.7.5) is not an identity any- 
more it holds up to an error term which gains two orders of magnitude against 
the crude growth estimate r(x)~ n . The same holds for the representation for- 
mulae in Lemma 4.7.2. The error terms on the right hand side are two orders 
better than the other integrands. 

Perspectives. The results of this paragraph are from [141]. Extensions of these 
results can be found in [126]. 


4.8 The Geometry of Manifolds of Nonpositive 
Sectional Curvature 

In this §, we shall present some results that apply to compact or complete 
Riemannian manifolds of nonpositive sectional curvature. It is very instruc- 
tive to see how strongly an infinitesimal geometric condition, namely that 
the sectional curvature is nonpositive, influences the global geometry and 
topology of the manifold in question. 

At one place, we shall refer to a subsequent Chapter for a proof ingredient. 
This is done for the sake of conciseness although the result in question can 
also be given an elementary - but not entirely trivial - proof with the tools 
already developed, and an ambitious reader may wish to find such a proof. 

From § 4.6, we obtain 
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Lemma 4.8.1 Let N be a Riemannian manifold with sectional curvature 
< 0. Let p £ M. Then the exponential map 

exp p : T p N — > N 

has everywhere maximal rank. 

Furthermore, for 

k(x) := -d 2 (x,p) 

if ex p p is a diffeomorphism on the ball B(p,p),x £ B(p,p),v £ T X N , we have 

X7dk(v,v) > ||u|| 2 (4.8.1) 

Proof. Corollary 4.6.1 and Theorem 4.6.1. □ 

These are local results. We shall now state a fundamental global result: 

Theorem 4.8.1 Let N be a complete Riemannian manifold of nonpositive 
sectional curvature, p,q £ N. Then in any homotopy class of curves from p 
to q, there is precisely one geodesic arc from p to q , and this arc minimizes 
length in its class. 

Proof. There exists a sequence (7 n ) of curves from p to q with 

lim L(pf n ) = r := inf {lengths in given homotopy class} 

n — >-oo 

(L denoting length). 

W.l.o.g., for all n 

7 n C B(p, r + 1) 

in particular 

7 n n B(p, r + 2 )\B(p, r + 1) = 0. 

The proof of Theorem 1.4.6 therefore works with B(p, r + 1) instead of the 
Riemannian manifold M considered there to show the existence of a shortest 
geodesic arc 7 from p to q in the given homotopy class. 

To show uniqueness, we first observe that by Theorem 4.1.1, every 
geodesic arc 7 from p to q is a strict local minimum of energy among all 
arcs with endpoints p and q, because Ij(W, W) > 0 for all W ^ 0 with 
W{p) = 0 = W{q). (Here, W is a section along 7. The index form J 7 was 
defined in (4.1.8).) 

Let now 7, : [0, 1] — » N, i = 1, 2, be homotopic geodesic arcs from p to q, 
with 71 ^ 72, and let 

r - [0,1] x [0,1] -*n 


be a homotopy, i.e. with 
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r{t, 0) = 7i(t),r(t,l) = 72 (t) for all t 
T( 0, s) = p, r( 1, s) = q for all s 

Let 

i?:= max E(r(-,s)). (4.8.2) 

sS[0,l] 

As in Theorem 6.11.3 below, one shows that there exists another geodesic arc 
73, different from 71 and 72, with 

max(if(7 1 ), 2S(7 2 )) < £(73) < R- (4.8.3) 

Again, by Theorem 4.1.1, 73 is a strict local minimum of E, and so, replacing 
e.g. 72 by 73 in the previous argument, we obtain a fourth geodesic arc 74 
with 

E( 73) < #(74) < R. 

(It is not hard to see from the proof of Theorem 6.11.3, that 73 may be 
connected with 71 or 72 through arcs of energy < R so that the maximum 
in (4.8.2) will not be increased.) We therefore obtain a sequence (7n)neN of 
geodesic arcs from p to q with 

R(7n) < R for all n. 

Let 7 n (t) = exp p tv n with v n € T p N, ||u„|| 2 < 2 R. After selection of a sub- 
sequence, (u„) ne N converges to some v £ T p M with ||u|| 2 < 2 R. Since all v n 
are different from each other, but exp p u„ = q for all n, exp p cannot have 
maximal rank at v. This is a contradiction, since by Lemma 4.8.1, the expo- 
nential map of a manifold of nonpositive curvature has everywhere maximal 
rank. Thus, 71 =72, proving uniqueness. □ 

As a corollary, we have the following result of Hadamard-Cartan 

Corollary 4.8.1 Let Y be a simply connected complete Riemannian man- 
ifold of nonpositive sectional curvature. Then Y is dijfeomorphic to R" 
(n = dimy), and such a diffeomorphism can be obtained from the exponential 
map 

ex Pp : T p Y(= IT) -4 Y 

of any p £Y. This exponential map is distance nondecreasing, i.e. 

||u — u>|| < d(exp p v, exp p w) for all v, w £ T p Y. 


Proof. Theorem 4.8.1 implies that for every p,q £ Y, there exists precisely 
one geodesic arc from p to q because there is only one homotopy class of such 
arcs as Y is simply connected. One easily concludes that for every p £ Y, 
exp p : T p Y — > Y is injective and surjective. (It is defined on all of T p Y because 
Y is complete.) Since it is of maximal rank everywhere by Lemma 4.8.1, it 
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follows that Y is diffeomorphic to T p Y. The distance increasing property of 
the exponential map follows from Corollary 4.6.1. □ 

Lemma 4.8.2 Let Y be a simply connected complete manifold of nonpositive 
curvature, p £ Y. Then, with k(x) = \ d 2 (x,p ), for every v € T X Y, x £ Y 

Vdk(v,v) > |H| 2 - (4.8.4) 


Proof. From Corollary 4.8.1 and Lemma 4.8.1. 
We also have 


□ 


Theorem 4.8.2 Let eft) and 02 (f) be geodesics in Y, a simply connected 
complete manifold of nonpositive sectional curvature. Then 

d 2 (ci(t),c 2 (t)) 


is a convex function of t. 


Proof. Since the geodesic arc from eft) to eft) is uniquely determined by 
Theorem 4.8.1, it depends smoothly on t. Hence d 2 (c\(t), 02 ft)) is a smooth 
function of t. For each t, we denote this geodesic arc from eft) to C 2 (t) by 
7 (s,t), with s the arc length parameter. Then 

d 2 (c 1 (f),c 2 (t)) = 2E( 1 (;t)). (4.8.5) 


Now by Theorem 4.1.1 (exchanging the roles of s and t in that theorem) 

d(ci(t),c 2 (t)) 

^£(7 (•,*))= J (V_a J^7(M),V_a ^p(s,t))ds 


d(ci(t),c 2 (t)) 




ds dt dt. ds 


(4.8.6) 


where R denotes the curvature tensor of Y. Since Y has nonpositive sectional 
curvature, (4.8.6) implies 


and with (4.8.5) the claim follows. 


□ 


A reformulation of the preceding result is 

Corollary 4.8.2 Let Y be a simply connected complete manifold of nonpos- 
itive (sectional) curvature. Then 
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d 2 : Y X Y -> R. 

is a convex function. (Note that here, d 2 is considered as a function of two 
variables.) 

Proof. According to Definition 3.3.6, we have to show that the Hessian of d 2 
is positive semidefinite. 

By (3.3.50), we have to compute the second derivative of d 2 along 
geodesics in Y x Y. Such geodesics c are given as (ci,c 2 ) where C\, c 2 are 
geodesics in Y. We thus have to show that d 2 {c\{f), c 2 (<) is a convex function 
of the arc length parameter t. This is Theorem 4.8.2. □ 

Remark. On a not necessarily simply connected Riemannian manifold N 
of nonpositive sectional curvature, the results of Lemma 4.8.2 and Theorem 
4.8.2 hold locally: 

If 

exp p : T p N — > N 

is a diffeomorphism on the ball {i> £ T. p N : ||i>|| < p) C T p N for some p > 0, 
then (4.8.4) holds for x £ B{p , p ) C N, and d 2 is convex on B(p, p) x B{p , p), 
i.e. for any geodesics Ci,c 2 : [0,1] — > B(p,p), d 2 {c\(t), c 2 (f)) is a convex 
function of t. 

Building upon Lemma 4.8.2, we shall now derive some quantitative ver- 
sions of the preceding convexity results 

Lemma 4.8.3 As always in this §, let N be a Riemannian manifold of non- 
positive sectional curvature, p £ N , and suppose that 

exp p : T p N — > N 

is a diffeomorphism on the ball {t; £ T p N : ||u|| < p} (here, p > 0, and if N 
is complete and simply connected, we may take p = oo by Corollary 4-8.1). 
Then 


d 2 {p,l(t)) <(l-i)d 2 (p,7(0)) + ^(P>7(l)) (4.8.7) 

-*( 1-^ 2 (7(0)),7(1))- 


Proof. Let fco : [0, 1] — > R be the function with 

fc o (0)=d 2 (p,7(0)), fco(l) = d 2 (p, 7 ( 1 )), k'^t) = 2\\ 1 \t)\\ 2 . 

Then 

d 2 (p, j(t)) < k 0 

as a consequence of (4.8.4). Since 
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k 0 (t) = (1 -f)fc 0 (0) + ffc 0 (l) - f(l - f)d 2 (7(0),7(l)) 

(note ||7 / (t)|| = ^(7(0), 7(1))), the claim follows. □ 

Corollary 4.8.3 Under the assumptions of Lemma 4-8.3, let 71,72 : [0, 1] — > 
C N be geodesics with 

7i(0) = p = 7 2 (0). 


Then, for 0 < t < 1 

d(7i(t),72(t)) < td( 7i(1),7 2 (1)). (4.8.8) 


Proof. Applying (4.8.7) to 71 (1) in place of p, 72(f) in place of 7 (t), 

d 2 (7i(l), 72(f)) < fd 2 (7i(l),72(l)) + (1 - t)d 2 { 7i(l),p) 

- f(l - f)d 2 (7 2 (l),p) 

Applying (4.8.7) to 72(f) in place of p, 71(f) in place of 7(f) 

d 2 ( 7 i(t), 72(f)) < td 2 (71(1), 72(f)) + (1 -f)d 2 (p, 72(f)) 

- f(l - t)d 2 { 7i(l),p) 

Noting d 2 (p, 72(f)) = f 2 c? 2 (p, 72(1)) and inserting the first inequality into the 
second one yields the result. □ 

Remark. It is also easy to give a direct proof of Lemma 4.8.3 based on the 
Jacobi field estimate (4.5.5). 

We now come to Reshetnyak’s quadrilateral comparison theorem: 

Theorem 4.8.3 As in the preceding lemma, let 

exp p : T p N — > N 

be a diffeomorphism on the ball of radius p in T p N, N a Riemannian manifold 
of nonpositive sectional curvature. 

Let 

7i,72 : [0, 1] -> B(p,p) C N 

be geodesics. For 0 < t < 1, and a parameter 0 < s < 1 then 

d 2 {l\ (0), 72(f)) + d 2 (7i(l), 72(1 -f)) 

< d 2 (7i (0), 72(0)) + d 2 (7i(l), 72(1)) + 2f 2 d 2 (7 2 (0), 72(1)) (4.8.9) 

+ t(d 2 { 7i(0),7i(l)) - d 2 (7 2 (0), 72(1))) 

- fs(d(7i(0),7i(l)) - d(7 2 (0),7 2 (l))) 2 

- f(l - s)(d(7i(0), 72(0)) - (i(7i(l),7 2 (l))) 2 . 
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Note that this inequality is sharp for certain quadrilaterals in the Euclidean 
plane. 

Proof. We first consider the case t = 1, s = 0. For simplicity of notation, we 
define 

:= d(7i(0),7i(l)), *=1,2, 

h := d(7i(0), 72(0)), b 2 := d(7i(l), 72(1)), 

di := d(7 2 (0),7 1 (l)), d 2 := d(7i(0), 72(f))- 



Also, we let 5 : [0,1] — > B(p, p) C N be the geodesic arc from 71 (0) to 72(1), 
as always parametrized proportionally to arclength. Its length is d 2 . We also 
put for 0 < A < 1 

d' x :=d( l2 (0),6(\)), d'{:=d( 7 i(1),5(A)). 

Then by (4.8.7) 

d% < (1 - A )b{ + A a\ - A(1 - A )d\ 
d'x < \b\ + (1 - A)of - A(1 - A )d\. 

Therefore, for 0 < e, 

d\ < {d'\ + d'f) 2 < (1 + e)d' 2 + ^1 + d " 2 

< (1 + e)(l - A )bf + (1 + e)Aa2 

+ (l + ^ A b\ + (l + ^ (1 - A )ai 

- ^2 + e+Jj A(l-A)^. 

We choose e = so that the coefficient in front of d \ becomes 1. This 
yields 

dl + d\ < al + al + ^ b\. 

A 1 — A 
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With 

A= _ & 1_ 
b 1 + b 2 ’ 

we obtain 

d 2 T d 2 — a i T a 2 T b 2 T b 2 — (pi — b 2 )~ . 

This is the required inequality for t = l,s = 0. For symmetry reasons, we 
also obtain the inequality for t = 1, s = 1, namely 

d 2 T d 2 ^ Cl^ T CL 2 T b 2 T 62 — (&1 — U‘2 ) 2 , 

and taking convex combinations yields the inequality for t = l, 0 <s<l: 

d 2 + d\ < a 2 + 03 + b\ + 62 — s(a 1 — a 2 ) 2 — (1 — s)(6i — & 2 ) 2 . (4.8.10) 

We therefore obtain the inequality for 0 < t < 1 from (4.8.7) and (4.8.10) 

d 2 (7i(0),72(*)) + d 2 (71 (1), 72(1 - t )) 

< (1 — f)6 2 + td^ — f(l — t)a\ + (1 — t)b\ + td\ — t{ 1 — t)a\ 

<b\ + b\ + 2t 2 a 2 — t{a\ — a 2 ) — ts(ai — a 2 ) 2 — f(l — s)(&i — 6 2 ) 2 . 

□ 

Theorem 4.8.3 allows us to derive the following quantitative version of 
the convexity of the distance between geodesics. 

Corollary 4.8.4 Let 71,72 : [0,1] — > N be geodesics as in Theorem 4-8-3. 
Then we have for 0 < t < 1, 0 < s < 1 

d 2 ( 7iW, 72(*)) < (l-^)d 2 (7i(0),72(0)) + td 2 (7i(l),7 2 (l)) (4.8.11) 

- t( 1 - t){s(d(7i(0),7i(l)) - d(7 2 (0),7 2 (l))) 2 
+ (1 - s)(d(7i(0),7 2 (0)) - d(7i(l),7 2 (l))) 2 }. 


Proof. We shall show the inequality for t = It is then straightforward to 
deduce the inequality for arbitrary t. 

We keep the notations of the preceding proof, and we also put 


ei := d(7i(0),7 2 (^)), e 2 := d(7i(l), 72^)). 


Then by (4.8.7) 


By (4.8.8) 


d 2 (7i(^),72(^)) < \e\ + ^ e% 




-s(ai-a 2 ) 2 


\( l - s)(bi - b 2 ) 2 . 
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Thus 

d 2 (7i(^),72(^)) < ^ b\ + - |s(ai - a 2 ) 2 - |(1 - s)(6i - 6 2 ) 2 

which yields the inequality for t = □ 

As an application of Theorem 4.8.3, let us consider the following Pythago- 
ras inequality 

Corollary 4.8.5 Let the assumptions of Lemma 4-8.3 hold. 

Suppose 

d(l(0),p) = min^d(j(t),p) 

(i.e. 7(0) is the point on 7 closest to p). Then 

d 2 {l{s) 1 p) > d 2 (7(0),p) + s 2 d 2 (j(0),j(l)) for 0 < s <1. (4.8.12) 


Proof. It suffices to treat the case s = 1. 

By (4.8.7), 

d 2 (l(t),p) < (1 - f)d 2 (7(0),p) + td 2 (7(l),p) - t( 1 - f)d 2 (7(0),7(l)). 
Since by assumption 

d 2 (7(0),.p) < d 2 (7(t),p), 

we get 

t.d 2 {j{l),p) > td 2 (7(0),p) + td 2 (7(0),7(l)) - f 2 d 2 (7(0),7(l)). 
Dividing by t and letting t — > 0 yields the desired inequality. □ 

We now turn to Karcher’s center of mass constructions and their applica- 
tions. While such constructions are meaning- and useful under more general 
conditions, here we only consider nonpositively curved manifolds, because in 
that case, the geometry is most favorable to them. 

Thus, let Y be a complete, simply connected, nonpositively curved Rie- 
mannian manifold. We recall that by Corollary 4.8.1, exp p : T p Y — > Y is a 
global diffeomorphism. This will be used implicitly below at several places. 
Let p be a probability measure on Y , i.e. a nonnegative measure with 

P{Y) = J dp= 1 . 

Definition 4.8.1 q £ Y is called a center of mass for p if 

J d 2 (q, y)dp(y) = mf J d 2 (p,y)dp(y) < 00 (4.8.13) 
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In the sequel we shall always assume that the infimum in (4.8.13) is finite. 
This is satisfied if, for example, the support of the measure p is bounded. 

Examples: 

1) If /x is a Dirac measure S q supported at q € Y, then q is its center of 
mass. 

2) If p = \{8 qi + S q2 ) for gi,<? 2 G Y, then the center of mass is y(|) 
where 7 : [0, 1] — > Y is the unique geodesic from <71 to q 2 - 

Lemma 4.8.4 

F (P ) ; = \ J d 2 {p,y)dp{y) 

is a differentiable function of p, with 

grad F(p) = - J exp ~ 1 {y)dp(y) (here, ex p” 1 : Y -> T p Y (4.8.14) 

is considered as a vector valued function.) 
Thus, q is a center of mass of p if 

J exp ~ 1 (y)dp(y) = 0. (4.8.15) 

Proof. (4.8.14) follows from (4.6.4). Thus, F is differentiable, and a minimizer 
has to satisfy grad F(p) = 0, i.e. (4.8.15). □ 

We now use the nonpositive curvature of Y in an essential manner: 

Lemma 4.8.5 

Pip) = \ J d 2 {p,y)dp(y) 

is a strictly convex function of p. 

Proof. From Lemma 4.8.2 by integration, because p is nonnegative. □ 

We deduce 

Theorem 4.8.4 There exists a unique center of mass for p, i.e. a unique 
q GY with 

J d 2 (q,y)dp(y) = mf J d 2 {p,y)dp{y) 


Proof. This follows from the strict convexity and the fact that F(p) is coer- 
cive, i.e. 
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F {jpn ) — ► oo if d 2 (p n ,po) — > oo for some fixed po 

and a sequence (p n )neN C Y. 


□ 


Remark. Up to this point, we have not used the normalization 

M- n = i- 

Thus, Theorem 4.8.4 holds for any nonnegative measure (provided the in- 
fimum in (4.8.13) is finite, of course). This will be applied in § 8.4 below. 
The subsequent estimates, however, will use this normalization; without that 
normalization, additional factors will occur. 

Lemma 4.8.6 Let q be the center of mass of p. Then for every p GY 

d(p,q) < || grad F(p)|| (4.8.16) 

and for every v € T q Y , \\X7 v grad F(g)|| > ||u|| (4.8.17) 

Proof. Let 7 : [0, 1] — > Y be the geodesic from q to p. 

Thus, 

\\j(t)\\ = d{p,q) for alii G [0,1]. 

We have 

( grad F(p), 7(1)) = - J (exp" 1 y, j(l))dp(y) 

= ~ J j t (^P^ t) y,i(t))dt^ dp{y) 

- J (ewf 1 y,i(Q))dp(.y). 

The last integral vanishes by (4.8.15), since q is the center of mass for p. By 
the proof of Corollary 4.6.1, since Y has nonpositive curvature (and since 
Dd_' y(t) = 0 as 7 is geodesic) 

— ( exp 7 ( t) Y > tW) > IliWII 2 - 

Thus || grad F(p)\\d(p, q) > ( grad F(p), 7(1)) > d(p,q) 2 which implies 
(4.8.16). (4.8.17) is the infinitesimal version of (4.8.16) (of course, (4.8.17) 
can also be derived directly from the proof of Corollary 4.6.1). □ 

Lemma 4.8.7 Let pi,P 2 be two probability measures on Y, with centers of 
mass qi,q 2 resp. Then 
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d(qi,q 2 )< J d{q 2 ,y)\dp l - dp 2 \{y) (4.8.18) 

Proof. By (4.8.16), with F.fp) = \j d 2 {y,p)dm{y), for i = 1,2, 
d{qi,q 2 ) < || grad Fi(g 2 )|| 

< | j exp ~ 2 1 (y)dp 1 (y)\ 

= 1 J {expf 2 1 y){dy 1 - dfi 2 )(y)\ since grad F 2 {q 2 ) = 0. 

We use | exp” 1 y\ = d(q 2 , y) to get (4.8.18). □ 

We now consider the situation where 

p = f*v, 

for some measurable map / : A — ► Y for a set A with a probability measure 
v. 

Then 

J d 2 (q,y)dp(y) = J d 2 {q, f{x))dv(x). (4.8.19) 

For the moment, v will be fixed, and so we shall call a minimizer a center of 
mass for the map /. 

Lemma 4.8.8 Let fi, f 2 - A — > Y be measurable maps with centers of mass 
q\ 1 q 2 , resp. Then 

d(qi,q 2 )< j d{f 1 {x) 1 f 2 (x)) dv{x) (4.8.20) 

Proof. By Lemma 4.8.6 and (4.8.14) 

d(qi, qi) < J exp” 1 f ± (x) du(x) 

= J (exp^ 1 fi(x) - exp” 1 f 2 (x)) dv{x) 

because q 2 is the center of mass for f 2 

< J d{fi(x),f 2 (x)) dv(x ), 


because the exponential map into a space of nonpositive curvature is distance 
nondecreasing by Corollary 4.8.1. □ 
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Corollary 4.8.6 Let f : A —> Y be measurable with center of mass q. Then, 
for all x & A, 

d(f(x),q)< j d{f{x),f{y)) dv(y) (4.8.21) 

Proof. We consider the map fi(y) = f{y) and the constant map f 2 (y) = /(x), 
for all y € A- the former has center of mass q, the latter center of mass f(x). 

We apply (4.8.20). □ 

The next result will be applied in § 8.9 below only: 

Corollary 4.8.7 Let f\\ (Ai,vi) — > Y, f 2 : {A 2 ,V 2 ) —> Y be measurable maps 
from probability measure spaces into Y. 

Let qi,q 2 be the corresponding centers of mass. 

Let (p : {Ai, v\) — > (A 2 , v 2 ) be measurable, with f 2 = fi o ip. 

Then 

d(qi,q 2 )< J d(fi(x),f 2 (<p(x))) dvifx) (4.8.22) 

+ [ d(f 2 (x),q 2 )\du 2 — ip*dvi\(x). 


Proof. Let q 2 be the center of mass for f 2 oip w.r.t. 

By Lemma 4.8.8 

d(qi,q 2 ) < J d(fi(x),f 2 o<p(x)) dvi(x). 

By Lemma 4.8.7, since q 2 is the center of mass for f 2 w.r.t. 

d(q 2 ,q 2 )< J d(f 2 (x),q 2 )\dv 2 - ip*dv i|(x). 

□ 

We now turn to the smoothing or mollification of maps with values in 
spaces of nonpositive curvature; this generalizes the standard construction 
for functions (’’Friedrichs mollification”). 

We consider any Cg° function 

p : R — » R 


with p{s) > 0 for all s and p{s) = 0 for ,s| > 1, for example 

P{s) ■= | 


exp 5231 for |s| < 1 
0 for \s\ > 1. 
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Given a ball B(x, h) C M in some Riemannian manifold M, with 0 < h < 
injectivity radius of M at a;, we put 


Px,h{y) 



(4.8.23) 


Here, d(x,y) is the distance from x to y £ B(x,h) w.r.t. the Riemannian 
metric of M. 

To simplify the presentation, and in particular to eliminate an additional 
dependence on x, here, we do not work with the Riemannian volume form on 
B(x,h) but rather with the Euclidean one, dz, induced via the exponential 
map exp a , : T X M — > M. Because of the denominator in (4.8.23), 


Px,h(v) d V 

defines a probability measure on B(x, h) (which we may extend by 0 to the 
rest of M). 


Definition 4.8.2 Given a map 

/ : M — > JV, 

N a Riemannian manifold of nonpositive sectional curvature 
its mollification with parameter h ( < injectivity radius of M) is defined by 
fh(x) := center of mass of / w.r.t. the measure 

Px,hi x ) dy on B{x 1 h) 

Thus, fh(x) is the unique minimizer of 

F (P) = \ J d 2 (f(z),p)p x , h (z)dz 

B(x,h ) 

Here, we do not need to assume that N is simply connected because on the 
simply connected ball, we can lift / to a map / : B(x,h ) — > N into the 
universal cover of N, apply the center of mass construction there and project 
back to N. 


Lemma 4.8.9 If f is locally integrable, then fh : M —> N is continuous for 
h > 0. 

Proof. Let X\,X 2 G M; we denote the above measures defined by ph on the 
balls B(xi,h),B(x 2 ,h) by v\ and v 2 , resp. By Lemma 4.8.7 


d(fh(xi), f h (x 2 )) < 


d{f{x) 1 f h {x 2 ))\dv 1 -dv 2 \{x) 
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and the difference measure disi — d goes to 0 if the distance between X\ and 
X 2 goes to 0. □ 

In fact, fh is even smooth for h > 0. To see this, recall that fh{x) as a 
center of mass is characterized by (4.8.15), i.e. 

grad F(f h (x)) = - J expj* {x) (f(z))p Xth (z)dz = 0 

B(x,h) 

Thus, in order to compute the derivative of fh w.r.t. x , by the implicit func- 
tion theorem, we must show that the derivative of grad F{p) w.r.t. p is non 
zero. 

This, however, follows from (4.8.17). 

Theorem 4.8.5 Let f : M — > N be locally integrable. Then, for 0 < h < 
injectivity radius of M, the mollification fh of f is smooth. 

Proof. We have just seen how the first derivative of fh w.r.t. x € M can be 
computed from the implicit function theorem. Because of the smoothness of 
p x ,hi z ) w.r.t. x, higher derivatives then also exist. □ 

Lemma 4.8.10 Let f be continuous at x G M. Then 

lim f h {x) = f(x) (4.8.24) 

h—> 0 

If f is uniformly continuous, then it is the uniform limit of the maps fh for 
h -► 0. 

Proof. Since / is continuous at x, given e > 0, we may find <5 > 0 such that 

f(B(x,S)) C B(f(x),e). 

Because the ball B{f{x),e) is convex, therefore also 

fh(x) C B(f( x),e) 

for 0 < h < 6. This implies (4.8.24). The remaining statement also follows 
from these considerations. 

We close this § with some constructions and results about the asymptotic 
geometry of complete simply connected Riemannian manifolds of nonpositive 
sectional curvature. Let Y be such a manifold for the rest of this §. 

Definition 4.8.1 Two geodesic rays Ci(t),C 2 (t)(t > 0) in Y (i.e. Ci,C 2 : 
[0, oo ) — > Y) parametrized by arc length are called asymptotic if there exists 
k £ M with 


d(c 1 (t),c 2 (t)) < k 
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for all t > 0. This defines an equivalence relation on the space of geodesic 
rays parametrized by arc length, and the set of equivalence classes is denoted 
by Y(oo). (Y(oo) is sometimes called the sphere at infinity of Y.) 

Example. In Euclidean space, two geodesic rays, i.e. straight half lines, are 
equivalent iff they are parallel. 

Lemma 4.8.11 For each pairp GY,xGY (oo), there exists a unique geodesic 
ray c = c px parametrized by arc length in the equivalence class defined by x 
with c(0) = p. 

Proof. Existence: Let cq be a geodesic ray representing x. For n € N, let c n (t) 
be the geodesic arc from p = c„(0) to Co(n), parametrized by arc length as 
usual, t n := d(p, co(n)), i.e. c n (t n ) = c 0 (n), and v n := fjf c n (t ) | t=0 € T p Y the 
tangent vector to c n at p. Since c n is parametrized by arc length, v n has length 
1, hence converges towards some v € T p Y after selecting a subsequence. We 
put 

c(t) := exp p tv , t. > 0. 

Because of the convexity of d 2 (co(t),c n (t)) (Theorem 4.8.2), for 0 < t < t n 
d 2 (c n (t),c 0 (t)) < max(d 2 (c„(0),c 0 (0)),d 2 (c rl (t n ),co(t„))). (4.8.25) 

We have 

t„ = d(c n (0), c n (t n )), since c n ist parametrized by arc length 
< d(c n (0),c o (0)) + d(c 0 (0),c 0 (n)) since c 0 (n) = c n {t n ) 

= d(p, c 0 (0)) + n 

and likewise 

n = d(c o (0),c o (n)) 

< d(co(0), c n (0)) + d(c n (0),c o (?i)) 

= d(co(0),p) + d(c n (0),c n (t n )) 

= d(p,c 0 ( 0)) +t„, 

hence altogether 


d(c 0 (t n ) , c 0 (n)) = | n - t n \ < d(p,c 0 { 0)). 

This implies in conjunction with (4.8.25) for 0 < t < t n 

d(c n (t) , c 0 (t)) < max(d(p, c o (0)),d(co(?i),c o (t rl ))) 
= rf(p,c o (0)). 

For n — > oo, we therefore also get 


d(c(t),c 0 {t)) < d(p,c o (0)). 


(4.8.26) 
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(4.8.26) means that Co and c are asymptotic. This proves the existence of 

Cpx “ C. 

Uniqueness: Let ci,C 2 be rays asymptotic to Co with ci(0) = p = C2(0). 
Then for all t > 0 

d 2 (ci(t), Cz(t)) < const. 

Since d 2 (c\(t), 02 (f)) ist convex in t by Theorem 4.8.2 and vanishes for t = 0, 
it vanishes identically, hence Ci(t) = 02 (f), proving uniqueness. □ 


Lemma 4.8.11 implies that for each p,Y( 00 ) can be identified with the 
unit sphere S p Y := {r> e T p Y : ||u|| = 1} in T p Y. Namely, each unit tangent 
vector uniquely determines an equivalence class of asymptotic geodesic rays. 
It is also not difficult to realize that the topology on Y ( 00 ) defined through 
this identification is independent of the choice of p. We thus obtain a natural 
topology on? = 7U7 ( 00 ) , the so-called cone topology. Y thus becomes a 
compact space. We call v € T p Y,w € T q Y asymptotic if the geodesic rays 
exp p tv, exp p tw(t > 0) are asymptotic. 

Since any isometry of Y maps geodesics onto geodesics and classes of 
asymptotic geodesic rays onto classes of asymptotic geodesic rays, each iso- 
metry of Y induces an operation on Y(oo), hence on Y . too. 


Perspectives. Corollary 4.8.1 goes back to the work of von Mangoldt, Hadamard, 
and E. Cart an. 

The center of mass has been likewise instroduced by E. Cartan. The construc- 
tions and applications presented here are due to Karcher[154]. In fact, Karcher’s 
constructions are more general than presented here and also apply to the case where 
the manifold can have positive curvature. Then, however, one has to work with local 
constructions, and one needs to assume that the measures are supported in some 
convex ball, more precisely in a ball of a radius that is smaller than min (injectivity 
radius, n/2^/H), k > 0 being an upper bound for the sectional curvature. Inspite of 
this restriction, of course the mollifications are quite useful, for example for creating 
or investigating Lipschitz maps. 

More generally, using triangle comparison properties as in this §, one can also 
introduce and investigate metric spaces with any upper and/or lower curvature 
bounds. For a general treatment, we refer to [15]. 

The theory of spaces with lower curvature bounds in the sense of Alexandrov 
has been systematically developed by Yu. Burago, M. Gromov, G. Perel’man[33]. 

Spaces with both upper and lower curvature bounds naturally arise as limits of 
Riemannian manifolds with those same curvature bounds, as will be discussed in 
the following Survey. 

Theorem 4.8.3 is a special case of a result of Y. G. Reshetnyak[205]. The proof 
given here is taken from [137]. 

If A' is a complete, simply connected Riemannian manifold of nonpositive cur- 
vature, then by Theorem 4.8.2 the squared distance between any two geodesics is a 
convex function of the arclength parameter. One may then abstract this property 
and call a complete metric space (Y, d) that is a geodesic length space, i.e. for which 
any two points can be joined by a length minimizing curve - such curves then again 



Exercises for Chapter 4 225 


are called geodesics - a metric space of nonpositive curvature if that convexity prop- 
erty holds. These spaces have been named after Busemann as he was the first to 
systematically investigate this property. A stronger property - which is still satisfied 
by all complete, simply connected Riemannian manifolds of nonpositive curvature 
as shown in Lemma 4.8.3 - is the one introduced by Alexandrov that the distances 
between any two points on a geodesic triangle are always less than or equal to the 
ones in a Euclidean triangle with the same side lengths. In fact, in the Riemannian 
case, both Busemann’s and Alexandrov’s property are equivalent to nonpositive 
sectional curvature. In the context of metric spaces, however, Busemann’s property 
is more general. A reference for these theories is [137]. For applications of these 
concepts, see the Perspectives on § 8.10. 

The compactification Y = Y U y(oo) of a complete simply connected Rieman- 
nian manifold of nonpositive curvature, sometimes called a Hadamard manifold, 
through asymptotic equivalence classes of geodesic rays is due to Eberlein and 
O’ Neill [66]. 

Anticipating some of the Perspectives for Chapter 8, the following monographs 
explore the geometry of nonpositive curvature: [12], [11], [64]. 


Exercises for Chapter 4 

1) Let Mi, M 2 be submanifolds of the Riemannian manifold M. Let the 
curve c : [a, b] —> M satisfy c(a) € M\,c{b) € M 2 . A variation c : 
[a, b] x (— e, e) — > M is called variation of c(t) w.r.t. Mi, M 2 if c(a, s) € 
Mi,c(b,s) € M 2 for all s € (— £,e). 

What are the conditions for c to be an extremal of L or E w.r.t. such 
variations? 

Compute the second variation of E for such an extremal and express 
the boundary terms by the second fundamental forms of Mi and M 2 . 

2) Let M be a submanifold of the Riemannian manifold N, c : [a, 6] — > N 
geodesic with c(a ) € M, c(a) € (T^M)- 1 . For r € (a, 6], c(r) is called 
a focal point of M along c if there exists a nontrivial Jacobi field X 
along c with A(a) € T C ( 0 )M, X(t) = 0. 

Show: 

a: If M has no focal point along c, then for each r £ (a, b), c is 

the unique shortest connection to c(r) when compared with all 
sufficiently close curves with initial point on M. 

b: Beyond a focal point, a geodesic is no longer the shortest con- 

nection to M. 

3) Let S’ 11-1 := {(a; 1 , . . . , x n , 0) € R” +1 , Ex l x l = 1} C S n be the equator 
sphere. Determine all focal points of S'" -1 in S n , and also all focal 
points of S n in K n+1 . 
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4) Let p. q be relatively prime integers. We represent S 3 as 

S 3 = {(z\, z 2 ) € C 2 : \zi\ 2 + \z 2 1 2 = 1}. 

Z q operates on S 3 via 

2nim 27r imp 

(zi,z 2 )^(z 1 e i ,z 2 e « ) (0<m<g — 1). 

Show that this operation is isometric and free. The quotient L(q,p) := 
S 3 /Z q is a so-called lens space. Compute its curvature and diameter. 

5) Show that any compact odd-dimensional Riemannian manifold with 
positive sectional curvature is orientable. (Hint: Use the argument of 
the proof of Synge’s theorem 4.1.2) 

6) Show that the real projective space RP" (cf. exercise 3 of Chapter 
1) is orientable for odd n and nonorientable for even n. (Hint: Use 
Synge’s theorem 4.1.2 and the preceding exercise) 

7) Show that Synge’s theorem does not hold in odd dimensions. (Hint: 
Use the preceding exercise or exercise 4 to give a counterexample) 

8) Try to generalize the theory of Jacobi fields to other variational prob- 
lems. 

9) Here is a more difficult exercise: 

Compute the second variation of volume for a minimal submanifold 
of a Riemannian manifold. 

10) Give examples to show that a curve c(t) = exp p tv as in Corollary 4.2.4 
need not be the shortest connection of its endpoints. (Hint: Consider 
for example a flat torus.) 

11) Let c : [0, oo) — > S n be a geodesic parametrized by arc length. For 
t > 0, compute the dimension of the space of Jacobi fields X 
along c with Jf(0) = 0 = X(t). Use the Morse index theorem 4.3.2 to 
compute the indices and nullities of geodesics on S n . 

12) Show that if under the assumptions of Theorem 4.5.1 we have equality 
in (4.5.6) for some t with 0 < t < r, then the sectional curvature of the 
plane spanned by c(s) and J(s) is equal to p for all s with 0 < s < t. 
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13) Let p € M, n = dim M, r(x) = d(x, p ) , 

1 r 2 (cc). 

w{x,t) := —n exp( — 

fa 4 1 

In the Euclidean case, w(x, t) is fundamental solution of the heat 
operator, i.e. for (x,t) ^ (p, 0) 

(J^ + A)w(x,t) = 0. 

Under the assumptions of Lemma 4.7.1, derive the estimate 

K<^ + A ) w ( x ’t)\ < 2 A2 -^- w (x,t) 

for (x,t) ± (p, 0). 
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We have now covered half of the chapters of the present textbook and the 
more elementary aspects of the subject. Before penetrating into more ad- 
vanced topics, a short survey on some directions of global Riemannian geom- 
etry may be a useful orientation guide. Because of the size and scope of the 
present book, this survey needs to be selective. 

A basic question, formulated in particular by H. Hopf, is to what extent 
the existence of a Riemannian metric with particular curvature properties 
restricts the topology of the underlying differentiable manifold. 

The classical example is the 

Gauss-Bonnet Theorem. Let M be a compact oriented , two-dimensional 
Riemannian manifold with curvature K. Then its Euler characteristic is de- 
termined by 

X(M) = — / K dvol M. 

2ir J 

M 

We have also seen some higher dimensional examples already, namely 
the Theorem 4.1.2 of Synge on manifolds with positive sectional curvature, 
the Theorem 3.5.1 of Bochner and the Bonnet-Myers Theorem (Corollary 
4.3.1) on manifolds of positive Ricci curvature. We have already seen a result 
for nonpositive sectional curvature, namely the Hadamard-Cartan Theorem 
(Corollary 4.8.1) that a simply-connected, complete manifold of nonpositive 
sectional curvature is diffeomorphic to some R ra , and in Chapter 8, we shall 
prove the Preissmann Theorem (Corollary 8.10.2) that any abelian subgroup 
of the fundamental group of a compact manifold of negative sectional curva- 
ture is infinite cyclic, i.e. isomorphic to Z. In order to put these results in a 
better perspective, we want to discuss the known implications of curvature 
properties for the topology more systematically. 

We start with the implications of positive sectional curvature. Here, we 
have the 

Sphere Theorem. Let M be a compact, simply connected Riemannian man- 
ifold whose sectional curvature I\ satisfies 

0 < —k < K < K 
4 “ 
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for some fixed number k. Then M is homeomorphic to the sphere S n (n = 
dimM). 

This was shown by Berger[16] and Klingenberg[157]. 

The pinching number | is optimal in even dimensions > 4, because CP m 
(see § 5.1) is simply connected, has sectional curvature between ) and 1 for 
its Fubini-Study metric and is not homeomorphic to S 2m for to > 1. In odd 
dimensions, the pinching number can be decreased below 1', as shown by 
Abresch and Meyer [2, 3], but the optimal value of the pinching constant is 
unknown at present. 

For n = 2 or 3, the conclusion is valid already if M has positive sectional 
curvature. For n = 2, this follows from the Gauss-Bonnet Theorem. For 
n = 3, Hamilton[109] showed that any simply connected compact manifold of 
positive Ricci curvature is diffeomorphic to S 3 . Hamilton studied the so-called 
Ricci flow, i.e. he considered the evolution problem for a time dependent 
family of metrics gij on M with Ricci curvature R l3 . 

d 2 

—gij(x,t) = —r(t)gij(x,t) -2 R i3 (x,f), 

with initial metric gij(x, 0) = g%{x) where 

/ R(x, t) dvol(g(-, t)) 
f dvol(g(-,t)) 

is the average of the scalar curvature of the metric gij(-,t). He showed that if 
g'-,- is a metric with positive Ricci curvature on a compact 3-manifold, then a 
solution of this evolution problem exists for all time, the Ricci curvature stays 
positive for all t, and as t — > oo, gij(-,t) converges to a metric of constant 
(positive) sectional curvature. 

This method has since become important in Riemannian geometry, al- 
though in general without suitable curvature assumptions on the initial met- 
ric, singularities will develop in finite time, the analysis was carried further in 
[110]. Recently, these singularities have been understood in complete detail 
by Perel’man, with profound implications for the structure and classification 
of 3-manifolds, see [196, 198, 197]. 

It is not known whether M as in the sphere theorem is diffeomorphic 
instead of just homeomorphic to S n . In other words, one has to exclude that 
exotic spheres carry 4 -pinched metrics. This so far has only been achieved 
for certain pinching numbers greater than 4 ; for a sample of results see e.g. 
[121], [207], [123], [235], 

It is not even known whether some exotic spheres can carry a metric of 
positive sectional curvature. Also, the problem of H. Hopf whether S 2 x S 2 
can carry a metric of positive sectional curvature is unsolved. The essential 
question is to understand compact, simply connected Riemannian manifolds 
of positive sectional curvature. Only very few examples of such manifolds are 
known. In fact, besides the general series of compact rank one symmetric 
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spaces (spheres, complex projective spaces (see § 5.1 below) in all even di- 
mensions, quaternionic projective spaces in all dimensions that are multiples 
of 4, and the Cayley projective plane in dimension 16), one only knows the 
family of Allof-Wallach spaces in dimension 7 and the isolated examples of 
Eschenburg and Bazaikin. 

In recent years, however, the first indications of a general structure theory 
seem to emerge, in the work of Petrunin, Tuschmann, Rong, Fang [202], 
[203], [72]. For a comprehensive treatment, see [249]. Essential points of this 
approach are that one studies the more general class of Alexandrov spaces of 
positive curvature which allows to study sequences of positively curved spaces 
and use compactness arguments by the result of Nikolaev quoted below, and 
in particular to utilize collapsing techniques and that the role of the second 
homotopy group becomes more prominent in determining the topological 
possibilities of positively curved spaces. (So, one might speculate that the 
theory of minimal 2-spheres developed in § 8.3 might furnish useful tools for 
understanding the topology of positively curved spaces.) 

We also mention that Wilking[255] showed that in general, a metric of 
positive curvature outside a finite number of points on a compact manifold 
cannot be deformed into a metric of positive curvature everywhere. 

For positive Ricci curvature, we have already exhibited some results. An 
important generalization of these results is Gromov’s 

First Betti Number Theorem. Let M be a compact Riemannian manifold 
of dimension n, with diameter < D and Ricci curvature > A (i.e. ( Rij — 
A gij)i,j is a positive semidefinite tensor). Then the first Betti number satisfies 

h (M) < f(n, A, D) 
with an explicit function f(n,X,D) 

(/( n > 0, D) = n, f(n,\,D) = 0 for A > 0). 


See [98, 101]. 

Finally, it has been determined which simply connected manifolds admit 
metrics of positive scalar curvature and which ones don’t, in the work of 
Schoen and Yau[218], Gromov and Lawson[102] and S. Stolz[232]. 

In the non simply-connected case, also restrictions for positive scalar cur- 
vature are known. For example, for dimension < 7, a torus cannot admit a 
metric of positive scalar curvature, see Schoen and Yau[217]. Such a result 
for any n and other restrictions on metrics of positive scalar curvature were 
given by Gromov and Lawson [103]. 

The preceding results all apply to compact manifolds. For noncompact 
manifolds, let us only quote the 
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Splitting Theorem. The universal covering M of a compact Riemannian 
manifold with nonnegative Ricci curvature splits isometrically as a product 
M = N x R fe , 0 < k < dimM, where N is a compact manifold. 

See Cheeger and Gromoll[43]. 

For a more detailed survey of manifolds of nonnegative curvature, we refer 
to the survey article [95]. 

For manifolds of negative or nonpositive sectional curvature, much more 
is known than for those of positive curvature. Some discussion can be found 
in the Perspectives on 8.10. We also refer to the survey article [65]. 

Lohkamp[170, 171] proved that any differentiable manifold of dimension 
> 3 admits a complete metric of negative Ricci curvature. As a consequence, 
negative Ricci curvature does not imply any topological restrictions. 

Riemannian manifolds of vanishing sectional curvature are called flat. The 
compact ones are classified by the 


Bieberbach Theorem. Let M be a compact flat Riemannian manifold of 
dimension n. Then its fundamental group contains a free abelian normal sub- 
group of rank n and finite index. Thus, M is a finite quotient of a flat torus. 

In analogy to the sphere theorem, one may ask about the structure of 
Riemannian manifolds that are almost flat in the sense that their curvature 
is close to zero. Since the curvature of a Riemannian metric may always be 
made arbitrarily small by rescaling the metric, the appropriate curvature 
condition has to be more carefully formulated in a scaling invariant manner. 
Let us look at the typical example: 

We consider the nilpotent Lie group H of upper triangular matrices with 
l’s on the diagonal. Its Lie algebra is 


tl = {A 


/° 

\o 



aij £ R, 1 < i < j < n}. 


On 1), we may introduce a family of scalar products via 


l|A|| 


2 . 

9 ' 


S a 2 ZJ q 

i<j 


2 (j~i) 


for q > 0. These scalar products induce left invariant Riemannian metrics on 
H whose curvature can be estimated as 


||R q (A,B)q| 9 <24(n-2) 2 ||A||2||B||2||C||2. 

This bound is independent of q. By a g-independent rescaling, we may there- 
fore assume that the sectional curvature satisfies \K\ < 1. We let H(Z) be the 
subgroup of H with integer entries, and one may thus construct left invariant 
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metrics on H which induce on the quotient H/H(Z) metrics with \K\ < 1 
and diam < e, for every e > 0, simply by choosing q sufficiently small. 

Conversely, 

Theorem. For every n, there exists e(n) > 0 with the property that any 
compact n-dimensional Riemannian manifold, M with 

\K\ (diam) 2 < e{n) 

is diffeomorphic to a finite quotient of a nilmanifold. (A nilmanifold is by 
definition a compact homogeneous space of a nilpotent Lie group.) 

This is due to Gromov, see [34] for an exposition, and for the refinement 
that M as above is actually an infranilmanifold by Ruh[206]. 

In order to place this result in a broader context, we introduce the notions 
of convergence and collapse of manifolds. For compact subsets Ai,A 2 of a 
metric space Z, we define 

d^{A u A 2 ) := inf{r : A x C U B(x,r ), A 2 C U B(x,r)} 

x€A 2 xeA! 


where B(x,r) := {y € Z : d(x,y) < r}. 

For compact metric spaces X\,X 2 , their Hausdorff distance is 
d H (X u X 2 ) := inf{ d^{i{X x ),j{X 2 )), 

where i : Xi — > Z,j : X 2 — » Z are 
isometries into a metric space Z } . 

This distance then defines the notion of Hausdorff convergence of compact 
metric spaces. Let Mo be a compact differentiable manifold of dimension n. 
We say that M 0 admits a collapse to a compact metric space X of lower 
(Hausdorff) dimension than M 0 if there exists a sequence (fJj)jeN of Rie- 
mannian metrics with uniformly bounded curvature on Mo such that the 
Riemannian manifolds (Mo, g-j) as metric spaces converge to X. This phe- 
nomenon has been introduced and studied by Cheeger, Gromov, and Fukaya 
[44, 45], [82], 

It is easy to see that any torus can collapse to a point; for this purpose, 
one just rescales a given flat metric by a factor e and lets e — > 0. The diameter 
then shrinks to 0, while the curvature always remains 0. Berger showed that 
S 3 admits a collapse onto S 2 . The construction is based on the Hopf fibration 
n : S 3 —> S 2 = CP 1 (see § 5.1), and one lets the fibers shrink to zero in length. 

In this terminology the above theorem (as refined by Ruh) says that 
those manifolds that can collapse to a point are precisely the infranilmani- 
folds. More recently, it was shown by Tuschmann[248] that any manifold that 
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admits a collapse onto some flat orbifold is homeomorphic to an infrasolv- 
manifold and conversely, that any infrasolvmanifold also admits a sequence 
of Riemannian metrics for which it collapses to a compact flat orbifold. Here, 
an infrasolvmanifold is a certain type of quotient of a solvable Lie group. 

We next mention the following result of Cheeger[41], with the improve- 
ments by Peters [199]. 

Finiteness Theorem. For any n £ N, A < oo, D < oo,v > 0, the class 
of compact differentiable manifolds of dimension n admitting a Riemannian 
metric with 

\K\ < A, diam < D , Volume > v 
consists of at most finitely many diffeomorphism types. 

The lower positive uniform bound on volume prevents collapsing and is 
necessary for this result to hold. 

Diffeomorphism finiteness can however actually also be obtained if no 
volume bounds are present and collapsing may take place. 

This is demonstrated by the following recent finiteness theorem by Petrunin 
and Tuschmann[203]. Instead of volume bounds this result only uses a merely 
topological condition: 

7T2-Finiteness Theorem. For any n € N, A < oo, and D < oo, the class of 
compact simply connected differentiable manifolds of dimension n with finite 
second homotopy group admitting a Riemannian metric with 

\K\ > A, diam < D 

consists of at most finitely many diffeomorphism types. 

Cheeger’s finiteness theorem was refined in the so-called Gromov conver- 
gence theorem, which we are going to present in the form proved by Pe- 
ters [200] and Greene and Wu[93]. 

Convergence Theorem. Let (Mj. gj )jen be a sequence of Riemannian man- 
ifolds of dimension n satisfying the assumptions of the finiteness theorem 
with A , D, v independent of j. Then a subsequence converges in the Haus- 
dorff distance and (after applying suitable diffeomorphisms) also in the (much 
stronger) C 1,a topology (for any 0 < a < 1) to a differentiable manifold with 
a C 1,a -metric. 

Such a family of manifolds is known to have a uniform lower bound on 
their injectivity radius. The crucial ingredient in the proof then are the a- 
priori estimates of Jost-Karcher for harmonic coordinates described in the 
Perspectives on 8.10. Namely, these estimates imply convergence of subse- 
quences of local coordinates on balls of fixed size, and the limits of these 
coordinates then are coordinates for the limiting manifold. 
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Nikolaev[193] showed that the Hausdorff limits of sequences of compact 
n-dimensional Riemannian manifolds of uniformly bounded curvature and 
diameter and with volume bounded away from 0 uniformly are precisely the 
smooth compact n-manifolds with metrics of bounded curvature in the sense 
of Alexandrov. 

Let us conclude this short survey by listing some other textbooks on 
Riemannian geometry that treat various selected topics of global differential 
geometry and which complement the present book, Chavel[40], Cheeger and 
Ebin[42], do Carmo [60], Gallot, Hulin and Lafontaine[83], Gromoll, Klingen- 
berg and Meyer[96], Klingenberg[159], Petersen [201], Sakai[211[. 

Finally, we wish to mention the stimulating survey Berger[17]. 



5. Symmetric Spaces and Kahler Manifolds 


5.1 Complex Projective Space 


We consider the complex vector space C" +1 . A complex linear subspace of 
C" +1 of complex dimension one is called a line. We define the complex pro- 
jective space CP" as the space of all lines in C" +1 . Thus, CP" is the quotient 
of C" +1 \{0} by the equivalence relation 


Z ~ W : ^ 3X G C\{0} : W = XZ. 


Namely, two points of C" +1 \{0} are equivalent iff they are complex linearly 
dependent, i.e. lie on the same line. The equivalence class of Z is denoted by 
[Z] . 

We also write 

z= (z°,...,z n ) e c n+1 

and define 

Ui := {[Z] : Z i yf 0} C CP", 

i.e. the space of all lines not contained in the complex hyperplane {Z l = 0}. 
We then obtain a bijection 

<Pi ■■ Ui - C" 


via 


<Pi([Z°,...,Z n ]) 



CP" thus becomes a differentiable manifold, because the transition maps 


(fij o ip. 1 : ipi{Ui n Uj) = {z = (z 1 , . . . , z n ) G C" : z° ^ 0} -> <Pj(Ui n Uj) 
Vr-d") = 1, Z l+1 , . . . , Z"]) 

fz 1 z i z i+1 z j - x Z j+1 
~ zi ’ zi 

(w.l.o.g. i < j ) 


are diffeomorphisms. They are even holomorphic; namely, with z k = x k + iy k 
( i = \/U 1) and 
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d 

1 , 

(A. 

. d 

~dz * “ 

" 2 ' 

\dx k 

1 dy k 

d 

1 , 

( _d_ 

. d 

1 n 

dz * " 

" 2 ' 

\dx k 

dy k 


we have 

° vAi* 1 , ■■■,*") = 0 for k = 1, . . . , n. 

Thus, CP" is a complex manifold in the sense of Definition 1.1.5. 

We consider the (n + l)-tuple 

(Z°,...,Z n ) 

(satisfying the restriction that not all Z J vanish identically) 

as homogeneous coordinates [Z] = [Z°,...,Z n ]. These are not coordinates 
in the usual sense, because a point in a manifold of dimension n here is 
described by (n + 1) complex numbers. The coordinates are defined only up 
to multiplication with an arbitrary nonvanishing complex number A 

[Z 0 ,...,Z n ] = [AZ 0 ,...,AZ n ]; 

this fact is expressed by the adjective “homogeneous”. The coordinates 
(z 1 , . . . , z n ) defined by the charts pi are called Euclidean coordinates. The 
vector space structure of C" +1 induces an analogous structure on CP" by 
homogenization: Each linear inclusion C m+1 C C" +1 induces an inclusion 
CP” 1 C CP". The image of such an inclusion is called a linear subspace. The 
image of a hyperplane in C" +1 is again called a hyperplane , and the image of 
a twodimensional space C 2 is called a line. 

Instead of considering CP" as a quotient of C" +1 \{0}, we may also view 
it as a compactification of C". One says that the hyperplane H at infinity is 
added to C"; this means the following: The inclusion 

C " -> CP" 


is given by 

(z 1 ,...,z n ) i ^ [l,z 1 ,...,z n ]. 

Then 

CP"\C" = {[Z] = [0, Z\ . . . , Z n ]} =: H, 

and H is a hyperplane CP" -1 . 

It follows that 


CP" = C" U CP" -1 = C" U C" -1 U...UC 0 


(5.1.1) 
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(disjoint union). Topologically, CP” thus is the union of (n + 1) cells of real 
dimension 0,2,..., 2 n. With the help of the Mayer-Vietoris sequence of coho- 
mology theory 4 , we may easily compute the cohomology of CP” from (5.1.1). 
In order to represent CP” as the union of two open sets as required for the 
application of this sequence, we put 

U := C", V := {z G C" : ||z|| = z j z 3 > 1} U CP”" 1 (as in (5.1.1)). 

Then V has CP”" 1 as a deformation retract (consider 

r t : V — > V, r t (z) = tz for z G C”, r t (w) = w for w G CP”" 1 

and let t run from 1 to oo), and U fl V is homotopically equivalent to the 
unit sphere S' 2 "" 1 of C”. 

We now observe first that CP 1 is diffeomorphic to S 2 . It actually follows 
already from (5.1.1) that the two spaces are homeomorphic. In order to see 
that they are diffeomorphic, we recall that S 2 may be described via stereo- 
graphic projection from the north and south pole by two charts with image 
C and transition map 

1 

Z - 
Z 

(cf. 1.1). This, however, is nothing but the transition map 

[M] [“'.>!] 

z 

of CP 1 . 

In particular, if 0 (CP 1 ) = if 2 (CP 1 ) = K, I/ 1 (CP 1 ) = 0. For the general 
case, the relevant portion of the Mayer-Vietoris sequence is 

H q ~ i (S 2 ”" 1 ) -> H q ( CP") -> H q { C”) © iJ^CP”" 1 ) -> I7 <? (5' 2 ”" 1 ). (5.1.2) 

We now want to show by induction w.r.t. n that 

H q (CP") = / ® ^ or 5 = 2, . . . , 2 n 

1 0 otherwise 

This is obvious for q = 0. For 2 < q < 2n — 1 we have H q ^ 1 (S 2n ~ 1 ) = 
0,H q (C n ) = 0, and for q = 2, . . . , 2n - 2 we obtain from (5.1.2) that 
IT? (CP") = K since by inductive assumption U^CP"" 1 ) = K, while for 
q = 1, 3, . . . , 2n — 1, again by inductive assumption, IF^CP"" 1 ) = 0, hence 
also H q { CP") = 0. The case q = 1 is similar. U 2 ”(CP") = M again follows 
from (5.1.2) or even more easily from Corollary 2.2.2. 

Let us also show that CP" can be considered as a quotient of the unit 
sphere S 2n+1 in C” +1 . Namely, each line in C” +1 intersects 5' 2 " +1 in a circle 


4 This sequence has been derived in the previous editions of this textbook, but for 
the present edition, we are not including an introduction to cohomology theory 
anymore as that can be readily found in standard textbooks on algebraic topology. 
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S 1 , and we obtain the point of CP" defined by this line by identifying all 
points on that circle. 

The projection 

7T : S 2n+1 -4 CP" 

is called the Hopf map. In particular, since CP 1 = S 2 , we obtain a map 

t t:S 3 ->S 2 


with fiber S 1 . 

The unitary group U(n + 1) operates on C" +1 and transforms complex 
subspaces into complex subspaces, in particular lines into lines. Therefore, 
U(n + 1) also operates on CP". 

We now want to introduce a metric on CP". For this purpose, let 
7T : C" +1 \{0} -4 CP" 

be the standard projection, U C CP", Z : U —> C" +1 \{0} a lift of 7 r, i.e. a 
holomorphic map with 7r o Z = id. We put 


u;:= l -ddlog\\Z\\ 2 , 


putting for abbreviation 


9:= dZ j ,B:= — T dZ ~ k . 
dZ:> ’ dZ k 

If Z' : U —4 C" +1 \{0} is another lift, we have 

Z' = pZ, 


(5.1.3) 


where tp is a nowhere vanishing holomorphic function. 

Hence 

log Ill'll 2 = l -dd(\og\\Z\\ 2 + \ogip + \og<p) 

= u> + ^(93 log ip — 99 log (p) (cf. (5.1.10) below) 

= w, 


since 91og^7 = 0 = 9 log <p, because tp is holomorphic and nowhere vanishing. 
Therefore, to does not depend on the choice of chart and thus defines a 2-form 
on CP". 

We want to represent u> in local coordinates; for this purpose, let as above 
U 0 = {[Z°,...,Z n ]:Z°^0}, 

since z l = on Uq, Z = (1, z 1 , , z n ) is a lift of 7r over Uq. Then 



5.1 Complex Projective Space 241 


ui = |<931og(l + z 3 z 3 ) 

_ i_ d ( z J dz J \ 

2 y 1 + z k z k J 

hence 

_ i f dz j A dz j z 3 z k dz 3 A dz k 
2 \ 1 + z k z k (1 + z e z e ) 2 

At [1,0, ... ,0] again 

to = A dz 3 = dx 3 A dy 3 . 


(5.1.4) 


(5.1.5) 


Thus, to is positive definite (in a sense to be made precise in Definition 5.1.1) 
at the point [1,0, . . . , 0] . Since w is invariant under the operation of U(n + 1) 
on CP”, it is therefore positive definite everywhere. 

We want to generalize the object to just introduced in the following 


Definition 5.1.1 Let M be a complex manifold with local coordinates z = 
(z 1 , . . . , z n ). A Hermitian metric on M is given by an expression of the form 

hj k (z)dz 3 ® dz k 

where hj k (z) depends smoothly (i.e. C°°) on z and is positive definite and 
Hermitian for every z. 

The expression 

k(z)dz 3 A dz k 

is called the Kahler form of the Hermitian metric. 


That hji . is Hermitian means 



^ kj hjk’ 

(5.1.6) 

We also put 

Let now 

h kj hjki hjk 0 hj k . 

d - d d - d 

V = V 3 — — r + V 3 t ,w = w 3 — — + w J — - 

dz 3 dz 3 dz 3 dz 3 

(5.1.7) 


be tangent vectors (with complex coefficients) in z £ M. 
We put 

(v,w) := hj k (z)v 3 w k + h k j(z)v 3 w k . 


If v and w are tangent vectors with real coefficients, i.e. 


iJL 

dx 3 


J+ n - 


dy j 


dx 3 


w 


i j+n _ 


dyi 


(5.1.8) 
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with v a ,w a € R, a = 1, . . . , 2 n, then because of 

d d d d . ( d d \ 

dxi dzi dzi ’ dyi \ dzi dzi ) 

we have 

(v, w) = h fk {v j + iv j+n )(w k - iw k+n ) 

+ h^{v j - iv j+n )(w k + iw k+n ) 

= 2 Re h fk {v j w k + v j+n w k+n ) 

+ 21m h jk (v j w k+n - w k v j+n ). 

Consequently, each Hermitian metric induces aRiemannian one. This justifies 
the name “Hermitian metric”. 

Definition 5.1.2 A Hermitian metric h^ k dz^ ®dz k is called a Kahler metric, 
if for every 2 there exists a neighborhood U of z and a function F : U — > R. 
with | hj k dz i A dz k = ddF. ddF then is called the Kahler form. 


The 2-form u> from (5.1.3) defines a Kahler metric on CP", called the 
Fubini-Study metric. 

This metric has many special properties. In particular, the operation of 
U(n+1) on C" +1 induces an isometric operation of U(n+1) on CP" equipped 
with this metric. This follows from (5.1.5) and the fact that || • || is invariant 
under the operation of SO(2n + 2), hence in particular invariant under the 
one of U(n + 1). 

For a line L in C" +1 we may also consider the reflection at L, i.e. 


s\l = id, 

S|ZA = - id 

s then induces an isometry a of CP" (equipped with the Fubini-Study metric) 
with fixed point 7 r(L) and 


In particular 


da = - id : T„ {l) CP" - T n{L) CP". 
a 2 = id. 


Definition 5.1.3 A Riemannian manifold is called symmetric if for every 
p £ M there exists an isometry a. p : M M with 

Vp(p) =P, 

Da p (p) = — id (as a self map of T p M) 


Such an isometry is also called an involution. 
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Thus, CP", equipped with the Fubini-Study metric, is a symmetric space. 

Thus, complex projective space carries two different structures: it is both 
a Kahler manifold and a symmetric space. The rest of this chapter is devoted 
to an investigation of those structures. 


5.2 Kahler Manifolds 


In the preceding section, we have introduced complex projective space as 
an example of a Kahler manifold. There exist simpler examples. Namely, C d 
with its standard Euclidean metric is a Kahler manifold with Kahler form 

= -cfo 7 ^ ■ 

Also, any complex 1-dimensional manifold S, that is, any Riemann surface 
(see 8.2) is automatically a Kahler manifold since du> is a 3-form and therefore 
vanishes on the real 2-dimensional manifold S. 

Moreover, any complex submanifold N of a Kahler manifold M is auto- 
matically a Kahler manifold itself; we simply need to restrict the local Kahler 
potential F of M to N. Therefore, in particular, all complex projective man- 
ifolds, that is, those that admit a holomorphic embedding into some complex 
projective space, are Kahler manifolds. This makes Kahler geometry a useful 
tool in algebraic geometry. 

In this section, we want to give a systematic introduction to Kahler ge- 
ometry. We start by recalling the rules from Lemma 1.5.4 for the calculus of 
the operators 3 and 3: 


d = 3 + 3, 

(5.2.1) 

33 = 0, 33 = 0, 

(5.2.2) 

33 = — 3d . 

(5.2.3) 

We can now state various equivalent versions of the Kahler condition 

w := h fk dz j A dz~ k = 33F, 

(5.2.4) 


that is, that for every z, there exist some neighborhood U and some function 
F defined on U with this property. 

Theorem 5.2.1 The following conditions are equivalent to a Hermitian man- 
ifold M being Kahler. 
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dio = 0, 

i.e. the Kahler form lo is closed. 

(ii) In local (holomorphic) coordinates 


or equivalently , 


dhjj 

8z k 

dha 


dh 


kj 


dz 


dh, 


for all i,j, k 


for all i,j,£. 


(5.2.5) 


(5.2.6) 


(5.2.7) 


Bz e dzt 

(in) At each zq £ M, holomorphic normal coordinates can be introduced, 


hij{zo) = Sij, -^{z Q ) = 0 = for alii, j,k,£. (5.2.8) 

In other words, we can find holomorphic coordinates near any zq, 
which we then take the liberty to identify with 0, so that for z near 0 


h ii(z) = S i:j +0(\z\ 2 ) 


(5.2.9). 


The last condition expresses the essential content of the Kahler condition, 
namely the compatibility of the Riemannian and the complex structure. Con- 
dition (i) has the advantage of expressing the Kahler condition in a global, 
coordinate invariant manner. This will make it particularly useful. 

Proof. We first show that the Kahler condition implies (i). 

d(ddF ) = (d + B)(dBF) = dddF - dddF = 0 by (5.2.2), (5.2.3). 

This yields (i). (ii) is the local coordinate version of (i). In turn, (i) implies 
the Kahler condition by the Frobenius theorem. Namely, since lo is closed, 
duj = 0, on each sufficiently small open set U, we can find a 1-form p with 
dp = lo. lo is a (l,l)-form, and so, when we decompose the 1-form 77 into a 
(1,0)- and a (0,l)-form, 77 = ry 1,0 + 77 0,1 , we have 

lo = dp = (d + d)r] = dp 0,1 + Bp 1 ’ 0 

with 

Bp 0 ’ 1 = 0 = Bp 1 ' 0 . 

From the last condition, on our sufficiently small U, we can then find s func- 
tions a and (3 with 

p 0 ' 1 = Ba, p 1 ' 0 = — d/3 , 
and so, keeping (5.2.3) in mind, 
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to = dd{a + (3). 

Since w is real (a> = to), we may then also assume that the function F := a+/3 
is real, and we have deduced the Kahler condition from (i). It thus only 
remains to show that (iii) is equivalent to the other conditions. It is clear 
that (5.2.9) implies dto(zo) = 0, that is, (i). For the converse, we first achieve 
by a linear change of coordinates that h i j(zo) = Sij. Thus, 

u> = — hjj.dz J A dz k = ~(6jk + aj k iz l + a^iZ^dz’ A dz k . 

Here, (5.1.6) implies that 

o-kj! = djki (5.2.10) 

and (i) yields 

&jkl = ®ikj • (5.2.11) 

We shall now make the linear terms disappear by the following change of 
coordinates 

^ = C J - \a m CY- (5.2.12) 

This yields 

w =\W - a lok C, k dC!) A (dC J - d njm rdC) 

+ ^ ( a jfczC Z + a jklC'')d C A d(, k + 0(\z\ 2 ) 

= Z -6 jk d(i Ad( k + 0(\z\ 2 ), 

using (5.2.10), (5.2.11). This is (5.2.9). □ 

In particular, the Kahler form u>, being closed, represents a (complex) 

cohomology class, i.e. an element of H 2 (M) ® C. 

Lemma 5.2.1 The Kahler form /i of a Kahler metric on a complex manifold 
M with dime M = n satisfies 

p n = n\*(l). (5.2.13) 


Proof. (5.2.13) is a pointwise identity. Let p € M. Since a Hermitian form 
can be diagonalized by a unitary transformation, we may assume that local 
coordinates are chosen such that at p 


H = -dz-’ A dz ’ = dx ’ A dy’ . 


Therefore 
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y n = n\dx 1 A dy 1 A dx 2 A dy 2 A ... A dx n A dy n 
= n! * (1), 

since dx 1 , dy 1 , dx 2 , dy 2 , . . . , dx n . dy n constitute a positive orthonormal basis 
of T*M. □ 

Corollary 5.2.1 The Kahler form of a Kahler metric on a compact manifold 
represents a nontrivial cohomology class, and so does every yl , j = 1 ,n. 
Therefore, the cohomology groups H 2 (M), H 4 (M), . . . , H 2n (M) of a compact 
Kahler manifold are nontrivial. 

Proof. By Lemma 5.2.1 

J y n = n! J *(1) = n!vol(M) > 0. 

M M 

If we now had y J = dip for some j £ {1, . . . , n}, then we would also have 



y j A y n ~ j 


j dip A y n ~ j 


J d{ip A y n ~ j 


), 


MM M M 

since y is closed by Theorem 5.2.1 
= 0 by Stokes theorem. 


This is a contradiction. 


□ 


Corollary 5.2.1 expresses an instance of the important fact that the exis- 
tence of a Kahler metric yields nontrivial topological restrictions for a mani- 
fold. We shall soon derive some deeper such results. Before doing that, how- 
ever, we state some useful local formulae in Kahler geometry. 

For the inverse of the Hermitian metric ( h f 3 ) , we use the convention 


hf j h kl = 6 ik (5.2.14) 

(note the switch of indices). With h := det(/i.^), the Laplace-Beltrami oper- 
ator (2.1.12) becomes 



This is most easily seen by using the coordinates given in (iii) of Theorem 
5.2.1 and then observing that both expressions transform in the right manner 
under coordinate transformations. 

Similarly, we have for the Christoffel symbols of a Kahler manifold 



(5.2.16) 
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because of (5.1.6), (5.2.6), (5.2.7). All other Christoffel symbols, that is, all 
those that contain both bared and unbared indices, vanish. Using this, the 
formula (3.1.31), (3.3.6) for the Riemannian curvature tensor also simplifies 
to become 


d 2 

r> u _ _ umn 

ijU " 


{dzJ hrn] 


Also, 


kit dd ijki 


With the first Bianchi identity (3.3.8), and 


we then obtain 
and analogously 


^ijkl Rilkj 
‘*kjik R ijki ’ 


(5.2.17) 

(5.2.18) 

(5.2.19) 

(5.2.20) 

(5.2.21) 


The Ricci tensor (3.3.18) of a Kahler metric is given by 

R kI =hf*R m . (5.2.22) 

From (5.2.17), we then have a simple formula for the so-called Ricci form 

R k ldz k A dz * = — 391ogdet (h^). (5.2.23) 


Finally, the scalar curvature of a Kahler metric is 


R = Z\logdet(/ijj). (5.2.24) 

The Ricci form is closed by (5.2.1)-(5.2.3) and therefore defines a cohomology 
class, the so-called first Chern class 


d (M) := R kl dz k A dz 1 (5.2.25) 

Z7T 

which is independent of the choice of Kahler metric. Namely, if hR is another 
Kahler metric on M with Ricci class R' k ^dz k A dz e = — <991ogdet(/iA), then 

{R kl - R' kl )dz k A dz 1 = -dd\og (5.2.26) 

and this is exact since is a globally defined function independent of the 

choice of coordinates (this follows from the transformation formula (1.4.3)). 

We recall from the end of 1.5 that on a complex manifold, the space of 
(complex- valued) fc-forms f2 k (M) admits a decomposition 
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Q k (M)= f2 p,q (M). (5.2.27) 

p+q=k 

The elements of f2 p,q are called (p, < 7 )-forms. f2 p,q is generated by forms of 
the type 

p{z)dz ix A ... A dz ^ A dz n A ... A dz 3 «. (5.2.28) 

We now use the Kahler form ui to define 

L : O p ' q -> f2 p+1 ’ q+ \ L (rj) := rj A u (5.2.29) 

and its adjoint w.r.t. the L 2 -product 

= / rj A *<r (5.2.30) 

Jm 

(where the star-operator * introduced in 2.1 has been linearly extended from 
the real to the complex case), 

A := L* : f2 p ’ q -> QP- 1 ^- 1 . 

For example, for p = ri^dz'' A dz k , we have, recalling u> = J 2 h.ji.dz q A dz k , 

A(v) = -2 ih fk rjf k (5.2.31) 

Theorem 5.2.2 On a Kahler manifold, we have the identities 

[A, d] = -id* (5.2.32) 

[A,d\=id*. (5.2.33) 

{[A,B] = AB — BA). 


Proof. Since A is a real operator because u> is real, each of these two identi- 
ties implies the other by conjugation. We shall now verify (5.2.33). For this, 
we shall use the Kahler condition in an essential way. Namely, A being the 
adjoint of the multiplication with the Kahler form to, its operation involves 
the Hermitian metric hfj, but no derivatives of it, see e.g. (5.2.31). Thus, 
the commutator of A with the first derivative operator d involves at most 
first derivatives of the Hermitian metric. By (iii) of Theorem 5.2.1, we may 
assume that these first derivatives vanish at the point under consideration. 
Therefore, we can neglect them and compute as on Euclidean space. Thus, we 
only need to verify (5.2.33) on C d , and we proceed to do so. In fact, most of 
the relevant formalism has been developed already in 1.8 and 2.1; we briefly 
recall it here. We have the L 2 -product of k - forms 

(«,/?)= / a A *0. (5.2.34) 

J c d 

To see the pattern, we check that in the case d = 1 
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*dz = *(dx + idy) = dy — idx = — idz and 
*dz = *(dx — idy) = dy + idx = idz , 


as well as 

*(1) = dx A dy = -dz A dz. (5.2.35) 

We let Cj be the exterior product with 4=dz/ 


e-jcx := —^dz-’ A a, 

J y/2 

and similarly 

e^a := — = dz 3 A a. 

1 a/2 

The factor here is inserted because the Euclidean norm of dz J = dx^ +idyi 
is Thus, the L 2 -adjoint ij of Cj is given by contraction with -^dz- 7 , that 
is 


Lj(dz jl A ... A dz jp A dz ^ 1 A ... A dz^) 

fO if j i 

~ \ (-1 ) /J -- 1 V2dz jl A ... A dz^ A ... A dz> A dz ?1 A ... A dz*-> if j = j M . 
We check this in a simple case - the general pattern will then be clear: 


(ei dz 1 ,dz 1 A dz 1 ) = (—= dz 1 A dz 1 , dz 1 A dz 1 ) 
a/2 

= v^dz 1 , dz 1 ) 

= (dz 1 , ti(dz 1 A dz 1 )). 

Next, we either recall (1.8.19), (1.8.20) (where, however, a somewhat different 
notation had been employed) or check directly that 


fob) ~b — f 

ejLc + Htj = 0 for j / £ 

6j Li + = 0 for allj, £. 

Putting dj := ^ and c/ := -J—*, we then have 

9 = ' - = \ 2 )//,(•/ 
j i 

a = 

j j 

9 = —\/2^^djij 

j 


(5.2.36) 

(5.2.37) 

(5.2.38) 
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3 

L = i Y. e :i^ 

3 

A =- i Y. L 3 i i- 

3 

Equipped with these formulae, it is now straightforward to complete the 
proof: 


Ad = —iV 2 iiL(djej 


3,1 


= — i V 2 dj iiLi €j 

3,1 

= ~i^2 ( fyjhtj + E d 3 L i^ e J 



J 

L J + E ® 3 L 3 

E 

3 


+ E 9 3 L 3 + 

E d 3 

3 


dj lj 



j i-t 


j I't 


j,t 3 

= dA + id* . 

Thus, we have shown the identity on C d , and the Kahler condition then makes 
this also valid on a general Kahler manifold, as explained. □ 


In addition to the Laplacian 

A = dd* + d*d, 

we can also build the operators 

A d := dd* + d*d and 
A§ ■= dd* + d*d. 

Theorem 5.2.3 On a Kahler manifold, 

A = 2 A a = 2 A s . 


(5.2.39) 

(5.2.40) 

(5.2.41) 

(5.2.42) 


Proof. From Theorem 5.2.2 
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A d = i(d[A,d\ + [A,B\d) 

= i(dAd — ddA + Add — dAd) 

= i(dAd + ddA - Add - dAd) by (5.2.3) 

= -i{d[A,d\ + [A,d]d) 

= A d . (5.2.43) 

Next, 

dd* + d*d = -i(d(Ad - dA) + (Ad - dA)d) = 0, (5.2.44) 

by Theorem 5.2.2 and (5.2.2). Finally, from (5.2.44) and (5.2.1), we easily get 

A = A d + Ag. (5.2.45) 

The relations (5.2.43) and (5.2.45) yield (5.2.42). □ 


In 2.2, we had defined the cohomology groups H k (M) and identified them 
with spaces of harmonic forms, that is, solutions of 


Ai y = 0, 


(5.2.46) 


see Theorem 2.2.1. From Theorem 5.2.3, we infer that the operator A pre- 
serves the decomposition (1.5.7) which in fact is orthogonal w.r.t. the L 2 - 
product, 

Q k (M) = 0 H p ’ q (M ), (5.2.47) 

p+g=fc 


that is, 


Z\ : (2 p ' q (M) -► a p ' q (M). 


(5.2.48) 


If we then define H p,q (M ) := H k (M) D fi p,q (M ) (p + q = k) as the space 
of harmonic forms of bidegree ( p,q ), we obtain the first part of the Hodge 
decomposition theorem, while the second part follows from the fact that 
A is a real operator and therefore, complex conjugation maps harmonic forms 
to harmonic forms: 


Corollary 5.2.2 For a compact Kahler manifold M, 

H k (M, C) = 0 H p ’ q (M, C) (5.2.49) 

p+q=k 

H p,q (M, C) = H q ’P(M, C) (complex conjugate). (5.2.50) 


The fctli Betti number of the compact manifold M (see Definition 2.2.1) 
is given by 

b k (M) = dim c H k (M,C), (5.2.51) 


and if we put 
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h p ’ q (M) = dim c £P> 9 (M,C), 

we obtain 

Corollary 5.2.3 For a compact Kahler manifold M, 

b k (M)= hP ’ q ( M ) 

p+q—k 


and 

and consequently 


h q,p (M) = h p,q (M), 
bk(M) is even for odd k. 


(5.2.52) 


(5.2.53) 


(5.2.54) 

(5.2.55) 


We have already seen a restriction on the topology of a compact Kahler 
manifold in Corollary 5.2.1. (5.2.55) is a deeper such restriction. 


Perspectives. Kahler geometry started with the remarkable paper of Kahler[151] 
that introduced the Kahler condition and derived all the basic formulae and the 
perspectives for the subsequent development of the subject. A thorough discussion 
of Kahler’s paper can be found in [27] and [153] . 

Some references that we have used in the present section are[251], [94] and [129]. 

Let us briefly mention some further aspects of Kahler geometry. Metrics on 
Kahler manifolds satisfying 

Rij = phij for some constant p 
are called Kahler- Einstein metrics. 

Since the Ricci form represents a cohomology class ci(M), there are necessary 
conditions for the existence of a Kahler-Einstein metric with positive, negative 
or vanishing p. Namely, ci(M) has to be representable by a positive or negative 
cohomology class, or has to be cohomologous to 0, resp. For nonpositive p, these 
conditions were also shown to be sufficient for the existence of a Kahler-Einstein 
metric on a compact M in famous work of S.T. Yau[263] (the case of negative p 
was also independently solved by Aubin, see the account in [9]). 

The case of positive p is not yet completely solved. In that case, there exist 
obstructions for the existence of Kahler-Einstein metrics. Existence results in cases 
where these obstructions vanish were obtained by Tian[245], Tian and Yau[247], 
Siu[226], Nadel[191]. Yau, Problem 65 in [264], conjecturally related the existence of 
a Kahler-Einstein metric to stability properties in the sense of algebraic geometry 
of the underlying manifold. Tian[246] developed the appropriate stability notion 
and showed its necessity for the existence of a Kahler-Einstein metric. He thus 
disproved the conjecture that a compact Kahler manifold with positive Chern class 
always admits a Kahler-Einstein metric if it has no nontrivial holomorphic vector 
field (another condition that is known to be necessary). 

As noted, every complex manifold with dime M = 1, i.e. every Riemann surface 
(see Definition 8.2.1), is Kahler since condition (i) above is trivially satisfied for any 
Hermitian metric. Moreover, in that case, the Kahler-Einstein metrics are simply 
the ones of constant curvature, and by the uniformization theorem, every Riemann 
surface admits such a metric since its universal cover (C, S 2 or the hyperbolic 
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upper half-plane H = {z = x + iy,y > 0}) does; in the latter case, the metric is 
p-(dx 2 + dy 2 ), see also 4.4. Moreover, the metric is unique up to isometries. 

If one studies the space of all compact Riemann surfaces of a given topological 
type (Teichmiiller theory), it is then convenient to investigate the space of all metrics 
of constant curvature on a given differentiable surface, because one can exploit 
additional geometric information. In a similar vein, the aforementioned results of 
S.T. Yau have found important applications in the classification of Kahler manifolds 
and algebraic varieties. 

A certain class of Kahler manifolds, the so-called special Kahler manifolds (see 
[78]), has become important in string theory. 


5.3 The Geometry of Symmetric Spaces 

Besides CP", we have already seen other examples of symmetric spaces: 

• R d , equipped with the Euclidean metric, i.e. d-dimensional Euclidean 
space E d . The involution at p £ E d is the map a p (x ) = 2p — x. 

• the sphere S d : Since its isometry group operates transitively on S d , 
it suffices to display an involution cr at the north pole (1,0, ...,0); 
such an involution is given by 

ct(x\ . . . , x d+1 ) = (a: 1 , -x 2 , . . . , -x d+1 ) 
in the usual coordinates. 

• hyperbolic space H d from 4.4. Again, the isometry group operates 
transitively, and it suffices to consider the point (1,0, . . . ,0) (in the 
notations from 4.4), the isometry here is 

cr(x°, . . . , x d ) = (x°, —x 1 , . . . , — x d ). 

In the sequel, V will always denote the Levi-Civita connection. 

Lemma 5.3.1 An involution cr p : M — > M of a symmetric space reverses the 
geodesics through p. Thus, if c : (— e,e) — » M is geodesic with c(0) = p (as 
always parametrized proportionally to arc length), then a p c(t) = c(—t). 

Proof. As an isometry, a p maps geodesics to geodesics. If c is a geodesic 
through p (with c(0) = p) , then 

D<j p c(fS) = — c(0). 

The claim follows since a geodesic is uniquely determined by its initial point 
and initial direction (cf. Theorem 1.4.2). □ 

Lemma 5.3.2 Let c he a geodesic in the symmetric space M, c(0) = p, c(r) = 
q. Then 
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a q a p (c(t)) = c(t + 2 r) (5.3.1) 

(for all t , for which c(t) and c(t + 2 r) are defined). For v £ T c i t )M, 
Da q Da p (v) € T c ( t + 2 T)M is the vector at c(f + 2r) obtained by parallel trans- 
port of v along c. 

Proof. Let c(t ) := c(t + r). c then is geodesic with c(0) = q. It follows that 

a q <7 p (c(t)) = <j q (c(—t)) by Lemma 5.3.1 
= v q {c{~t - t )) 

= c(t + t) 

= c(t + 2 r). 

Let v £ T p M and let V be the parallel vector field along c with V(p) = v. 
Since a p is an isometry, Dcr p V is likewise parallel. Moreover, Da p V(p ) = 
—V(jp). Hence 

Da p V(c(t )) = 

Da q o Da p V ( c{t )) = V ( c(t + 2r)) as before. □ 

Corollary 5.3.1 A symmetric space is geodesically complete, i.e. each geodesic 
can be indefinitely extended in both directions, i.e. may be defined on all of 

R. 


Proof. (5.3.1) implies that geodesics can be indefinitely extended. One simply 
uses the left hand side of (5.3.1) to define the right hand side. □ 

The Hopf-Rinow Theorem 1.4.8 implies 

Corollary 5.3.2 In a symmetric space, any two points can be connected by 
a geodesic. □ 

By Lemma 5.3.1, the operation of a p on geodesics through p is given by a 
reversal of the direction. Since by Corollary 5.3.2, any point can be connected 
with p by a geodesic, we conclude 

Corollary 5.3.3 <j p is uniquely determined. □ 

Definition 5.3.1 Let M be a symmetric space, c : M — > M a geodesic. The 
translation along c by the amount t £ R is 

T t '■= a c(t/2) ° CC(0)- 


By Lemma 5.3.2, r t thus maps c(s) onto c(s + t), and Dr t is parallel 
transport along c from c(s) to c(s + t) . 
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Remark. r t is an isometry defined on all of M. r = r t maps the geodesic c 
onto itself. The operation of r on geodesics other than c in general is quite 
different, and in fact r need not map any other geodesic onto itself. One may 
see this for M = S n . 

Convention For the rest of this paragraph, M will be a symmetric 
space. G denotes the isometry group of M. Go is the following subset of G : 

Go : ={gt for tgR, where s i— > g s 

is a group homomorphism from K to G }, 

i.e. the union of all one-parameter subgroups of G. (It may be shown that 
Go is a subgroup of G.) 

Examples of such one-parameter subgroups are given by the families of 
translations (r t ) te R along geodesic lines. 

Theorem 5.3.1 Go operates transitively on M. 

Proof. By Corollary 5.3.2, any two points p,q G M can be connected by a 
geodesic c; let p = c(0), q = c(s). If (r t ) t6R is the family of translations along 
c, then 

q = r s {p). 

We thus have found an isometry from Go that maps p to q. □ 

Definition 5.3.2 A Riemannian manifold with a transitive group of isome- 
tries is called homogeneous. 

Theorem 5.3.2 The curvature tensor R of M is parallel, VI? = 0. 

Proof. Let c be a geodesic, and let X, Y, Z, W be parallel vector fields along 
c, p = c(t 0 ),q = c(to + t). Then q = T t fp) and 

( R(X(q),Y(q))Z(q),W(q )) = (R(d.T t X(p), dT t Y{p))dT t Z{p), dT t W{p)) 

by Lemma 5.3.2 
= (R(X(p),Y(p))Z(p),W(p)) 
since Tt is an isometry. 

Let now v := c(fo)- The preceding relation gives 

v(R(X,Y)Z,W) = 0, 
and since X , Y, Z, W are parallel, 

((V v R)(X,Y)Z, W) = 0. 

Since V V R like I? is a tensor, (X7 V R)(X, Y)Z depends only on the values of 
X, Y, Z at p. Since this holds for all c, X , Y, Z, W we get VI? = 0. □ 
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Definition 5.3.3 A complete Riemannian manifold with S7R = 0 is called 
locally symmetric. 

Remark. One can show that for each locally symmetric space N there exist 
a simply connected symmetric space M and a group r operating on M 
discretely, without fixed points, and isometrically, such that 

N = M/r. (5.3.2) 

Conversely, it is clear that such a space is locally symmetric. Examples are 
given by compact Riemann surfaces of genus g > 2 which may be realized as 
quotients of the hyperbolic plane H 2 . Let us also introduce different examples, 
the so called lens spaces: 

We consider S 3 as unit sphere in C 2 : 

S 3 = {(z 1 ,z 2 ) e C 2 : |« 1 | 2 + |« 2 | 2 = 1}. 

On S 3 , we then have an isometric action of the torus S 1 x S 1 , namely 

(z\z 2 ) i — ► (e ilpl z 1 , e i,p2 z 2 ) for 0 < <p\ip 2 < 2ir. 

Let now }>,<;{! N be relatively prime with 1 < p < q. 

Let Z q be the cyclic group of order q. We then obtain a homomorphism 

Z q -> S' 1 x S 1 

r i—> (e 2 ' Kir t q , e 2nipr / q ). 

Thus, Z q operates isometrically on S 3 . Since p and q are relatively prime, 
this operation has no fixed points, and the lens space 

L(q,p ) :=S 3 /Z q 

is a manifold. 

Actually, L( 2, 1) is not only locally symmetric, but symmetric. More pre- 
cisely, L( 2, 1) is the three dimensional real projective space. 

For q > 2, however, the lens spaces are not symmetric. For example, the 
involution at p = (1, 0) € -S' 3 is given by 

a p (z\z 2 ) = (z\-z 2 ) 

(in our complex notation) (recall the definition of S d at the beginning of this 
paragraph). a p therefore does not commute with the Z q action. Therefore, 
the involution a p does not carry over to L(q,p). Since on the other hand 
each involution is already determined by its operation on the tangent space 
and since an involution would have to operate in the same way as a p on the 
tangent space of the point corresponding to p in the lens space, the lens space 
cannot possess any such involution and hence cannot be symmetric. 
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We now want to determine the Jacobi fields on (locally) symmetric spaces. 
For a Riemannian manifold N, p £ N,v £ T p N we define an operator 

R v : T p N -> T p N 

by 

R v (w) = R(w, v)v. (5.3.3) 

For a geodesic c, Rt(t) maps the orthogonal complement of c(t) in T C ^N 
onto itself. 

The operator R^t) is self-adjoint. This follows from (3.3.10) and (3.3.9) 
or (3.3.7): 

(R v (w),w') = (R(w,v)v,w') = (R(w',v)v,w) = (R v (w'),w). 

Since R is parallel for a locally symmetric space, Rc(t) commutes with parallel 
transport along c. 

Let v be an eigenvector of i?c(o) with eigenvalue p with ||i>| = 1, ||c(0)|| = 1 
(this can be achieved by reparametrization) , and 

(u,c(0)) = 0, 


i.e. 

i2(v,c(0))c(0) = Rc( o)(n) = pv. 

Let v(t) be the vector field obtained by parallel transport of v along c. Then 
v(t) is an eigenvector of i?c(t) with eigenvalue p, since R is parallel. Thus 

R(v(i),c(t))c(t) = pv(t). (5.3.4) 

(5.3.4) implies that the vector fields 

J\{t) := c p (t)v(t) and J^t) := s p (t)v(t) (5.3.5) 

(c p and s p defined as in 4.5) satisfy the Jacobi equation: 

J i (t) + R(J i (t),c(t))c(t) = 0 (* = 1,2). (5.3.6) 


Thus 

Theorem 5.3.3 Let N be a locally symmetric space, c geodesic in N, 
c(0) =: p, Vi,...,v n -i an orthonormal basis of eigenvectors of i?c(o) or- 
thogonal to c(0) with eigenvalues p\, . . . , p n - i, v\ (t), . . . , v n -\(t) the parallel 
vector fields along c with Vj{ 0) = Vj (j = 1, . . . ,n — 1). The Jacobi fields 
along c (orthogonal to c) then are linear combinations of Jacobi fields of the 
form 


c Pj {t)vj{t) and s Pj (t)vj(t). 


(5.3.7) 

□ 
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Definition 5.3.4 Let g be the Lie algebra of Killing fields (cf. Lemma 1.6.8) 
on the symmetric space M, and let p G M. We put 

t := {X G g : X(p) = 0} 
p := {X G g : VI(p) = 0}. 


Theorem 5.3.4 B ® p = g, B fl p = {0}. 

Proof. B fl p = {0} follows from the facts that each Killing field is a Jacobi 
field (Corollary 4.2.1) (along any geodesic) and that Jacobi fields that vanish 
at some point together with their derivative vanish identically (by Lemma 
4.2.3) and finally that by Corollary 5.3.2 any two points can be connected 
by a geodesic. Let now X G g with X(p) ^ 0. Let c(t) := exp p tX(p) be the 
geodesic with c(0) = X(p), and let r t be the group of translations along c 
(Definition 5.3.1). Then 

Y (q) ~ j t n{q)\t=o (5-3.8) 

is a Killing field, since the r* are isometries (Lemma 1.6.7). 

We have 

Y{p)=X{p). (5.3.9) 

For v G T p M, let j(s) be a curve with 7 ; (0) = v. Then 

V v Y(p) = V^^r t (7(s))| s=t=0 

d 

= v lfe T ‘W s ))i»=« 

= V a Dr t (v) | t=0 

dt 1 

= 0, (5.3.10) 

since by Lemma 5.2.2 Dr* is parallel transport along c, and hence Drt(v) is 
a parallel vector field along c. 

We conclude 

X = (X ~ Y) + Y with (X - Y) G t by (5.3.9), 

Y G p by (5.3.10) . □ 

Theorem 5.3.5 As a vector space, p is isomorphic to T p M. The one- 
parameter subgroup of isometries generated by Y G p is the group of transla- 
tions along the geodesic exp tY(j>). 

Proof. Let w G T p M. Let c(t) := exp tw be the geodesic with c(0) = w. Let 
T t be the group of translations along c. As in (5.3.8), we put 
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Y(q) ■= ^r t (g)| t=0 for all q G M. (5.3.11) 

As in the proof of Theorem 5.3.4, we obtain 

Y(p) = w and hep. 

This induces a linear map from T p M to p. The inverse of this map is simply 
the restriction mapping Y G p to Y(p). Thus, we have found a bijective linear 
map between T p M and p. By (5.3.11), Y also generates the one-parameter- 
subgroup Tt (surjectivity follows from the proof of Theorem 5.3.4). □ 

Let us introduce the following notation: For a Killing field we denote the 
(at this point only local) 1-parameter group of isometries generated by X by 
e tx (instead of the previous notation if t or ip t ). 

Lemma 5.3.3 Let X be a Killing field on the symmetric space M. Then e tx 
is defined for all tGR. Thus, (e tA ') te R is a 1-parameter- group of isometries. 

Proof. Let q G M. We want to show that e tx (q) is defined for all tGR. We 
shall show that this is true for t > 0, since the case t < 0 is analogous. Let 
now 

T := sup{f G R : e TX (q) is defined for all r < t}. 

We assume T < oo and want to reach a contradiction. 

We put 

m := sup {d{q,e tx {q)) : t < T/ 2}. 

Since each g G G is an isometry, we have for all x,y G M 

d(gx, gy) = d(x,y). 

Hence also 

d{g 2 q 1 gq ) = d(gq,q), 

and thus 

d(g 2 q,q ) < 2 d(gq,q). 

Therefore for 0 < t < T 

d(e tX (q) 1 q)<2d(e t / 2X (q),q)<2m. 

Therefore, for all 0 < t < T, e tx (q) is contained in 

B(q, 2m), 

which is a compact set. 

As in the proof of Corollary 1.4.3, we see that there exists e > 0 with 
the property that for all x G B(q, 2m) e tx (x) is defined for |f| < e. Thus, for 
r := T — s/2, 

e eX (e rX (q)) = e^ T+s/2)x (q) 
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is defined. 

This contradicts the assumption on T and proves the claim. □ 


For Y G p, we thus obtain from Theorem 5.3.5 

e tY = t u (5.3.12) 

where (r t ) is the family of translations along the geodesic exp tY(j>). 

We now define a group homomorphism 

s p :G-^G 

by 

Sp(g) = cr p o g o a p , (5.3.13) 

where <j p : M —> M is the involution at p. Since a p = id, we have 

s P (g) = cr p o g o a- 1 . (5.3.14) 

We obtain a map 

9p : 0 0 

by d 

0 P (X) := j t s P (e tx ) | t=0 . (5.3.15) 

Theorem 5.3.6 9 p | t = id, 9 p \ p = — id. 

Proof. Let X € 4, i.e. X(p) = 0. Then for all t 

e tx { p )=p. (5.3.16) 

Let Ci be a geodesic with C\ (0) = p. Then for all t, 

c 2 (s) := e tx a (s) 

likewise defines a geodesic through p, i.e. C2(0) = p. It follows that 

s p (e tx )ci(s) = Up o e tx o a p ci(s) 

= (j p o e tx c\{— s) by Lemma 5.3.1 

= CTpC 2 (-S ) 

= C2(s), 

i.e. s p (e tx )c 1 (s ) = e tx c 1 (s). 

Since each q £ M can be connected with p by a geodesic (Corollary 5.3.2), 
we obtain 


s P (e tX )(q) = e tX {q) 
for all q £ M, i.e. s p (e tx ) = e tx , and hence also 


i.e. 9 p \t = id. 5 


9 P (X) = X, 


5 


One may easily modify the proof at this place so as to avoid using the completeness 
of M. 
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Let now L € p. From (5.3.12) (cf. Theorem 5.3.15), 

e tY = Tt = & c (t/ 2 ) 0 <?p by Definition 5.3.1, 
where c(t) = exp p tY (p) . Hence 

/ t \ 

Sp\C ) = &p ° &c(t/2) ° &p ° &p 

= <jp o <T c (t/ 2 ) because of a p = id 

= r-t, 

which may be seen e.g. as follows: Let q = c(t/2),c(s) = c(t/2 — s). Then 
p = c(f/2),c(0) = q, hence 

(jp o cr c (t/2) = cr £ (t/2) o crg (0) . 

Therefore, this is the translation along c by the amount t. Since c is traversed 
in opposite direction as c, this is the same as translation along c by the amount 
—t. 

Since 

T—t = 

it follows that 

s P (e tY ) = e~ tY 

hence 

o p (y) = - y , 

i.e. 

— id- n 


Lemma 5.3.4 9 P [X, Y] = [0 p X, 0 p F] for all X,Y £ 0 . Thus, 9 P is a Lie 
algebra homomorphism. 


Proof. By definition of 9 P (5.3.15), 9 P (X) generates the 1-parameter group 

gt&p(X) ^ | 

S p(e tx ) = e td » (x) . (5.3.17) 

Now 


lX,Y} = -De- tx oYoe tx \ t = 0 


& 2 

d ids 


e~ tx e sY e tx 


t—s—0 • 


cf. Theorem 1.6.4 (ii) 


(5.3.18) 
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Hence 


9 P [X,Y\ = ^—a p e tx e sY e tx a p{t=s=0 


dtds 

d 2 

dtds 

& 2 

<9t<9s 

a 2 


Cp€ (Jp (JpC (7p <Jp€ CTp\t—s=0 


s P {e tx )sp(e sY )s p (e tx )\ t = s -o (cf. (5.3.14)) 

e -tOp(X) e s6 p (Y) e t9 p (x)^^ by ( 5 3 _ 17 ) 


dtds 

= [9 P (X),9 P (Y)} by (5.3.18). 


□ 


Theorem 5.3.7 [1, t] C 1, [p, p] C 4, [1, p] C p. 

Proof. Because of 9 p = id, 9 P has eigenvalues —1 and 1. By Theorem 5.3.6, 
1 is the eigenspace with eigenvalue 1 , p the eigenspace with eigenvalue — 1 
(note that by Theorem 5.3.5, g = t©p). If X is an eigenvector with eigenvalue 
A, Y one with eigenvalue /./,, then, since 9 P is a Lie algebra homomorphism 
(Lemma 5.3.3), [X, Y] is an eigenvector with eigenvalue A/i. This easily gives 
the claim. □ 

Corollary 5.3.3 t is a Lie subalgebra of g. 

Proof, t is a subspace of g and closed w.r.t. the Lie bracket by Theorem 5.3.7. 

□ 


Corollary 5.3.4 With the identification 

T p M ~ p from Theorem 5.3.5, 
the curvature tensor of M satisfies 

R(X, Y)Z(p) = -[[X,Y],Z)(p) (5.3.19) 

for X,Y,Ze p. 

Proof. Let X G g,Y G p. The geodesic exp p tY(p) satisfies 

Y (c(t)) = c(t) for all t € M. 

This follows e.g. from Theorem 5.3.5. 

Since by Corollary 4.2.1, X is a Jacobi field along c, we obtain 

V y V y X + R(X,Y)Y = 0 (5.3.20) 


along c, hence in particular at p. 
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This implies that we have also for Y, Z € p, since then also Y + Z € p, 
that 

VyV z T + V z V y l + R(X, Y)Z + R(X, Z)Y = 0 (5.3.21) 

at p. 

Now by (3.3.7) 

R(X, Z)Y = —R(Z, X)Y, (5.3.22) 

by (3.3.8) 

R{X, Y)Z + R(Y, Z)X + R(Z , X)Y = 0 (5.3.23) 

and by (3.3.3) 

R{Y,Z)X = \/ Y \/ z X - XzYIyX - Vy,z\X. (5.3.24) 

By Theorem 5.3.7, for Y, Z G p, \Y, Z) £ t, hence 

[Y,Z](p) = 0. (5.3.25) 

(5.3.21) - (5.3.25) imply 

XyV z X + R{X,Y)Z = 0 (5.3.26) 

at p. 

By (5.3.23) and (5.3.22) for X,Y,Z £p 

R(X, Y)Z(p) = -R(Y, Z)X(p) + R(X, Z)Y(p) 

= V z V x Y(p) - VzVyX(p) by (5.3.26) 

= V z [X,Y](p) 

= X7 [x ,y ] Z(p)-[[X,Y],Z)(p) 

= -[[X,Y],Z}{p) 

because of [X, Y](p) = 0 (Theorem 5.3.7). □ 

Corollary 5.3.5 The sectional curvature of the plane in T p M spanned by 
the orthonormal vectors Yi(j>),Y 2 {p) (Yi,Y 2 G p) satisfies 

K(Y 1 (p) A Y 2 (p)) = -([[YyY^Y^Y^ip). 


Proof. From (5.3.19). 


□ 
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5.4 Some Results about the Structure of Symmetric 
Spaces 


In this paragraph, we shall employ the conventions established in the previous 
one. 

Let us first quote the following special case of a theorem of Myers and 
Steenrod: 

Theorem. The isometry group of a symmetric space M is a Lie group, and 
so is the group Go defined in 5.3. Moreover g is the Lie algebra of both G and 

G 0 . 


A proof may be found, e.g., in [116]. □ 

Technically, this result will not be indispensable for the sequel, but it is 
useful in order to gain a deeper understanding of symmetric spaces. 

We now start with some constructions that are valid not only for the 
isometry group of a symmetric space but more generally for an arbitrary Lie 
group G with Lie algebra denoted by g. 

Each h £ G defines an inner automorphism of G by conjugation: 

Int (ft) :G->G 

g i— > hghr 1 . 

Putting h = ( 7 p , here we obtain s p from 5.3. 

0 as a Lie algebra in particular is a vector space, and we denote the group 
of vector space automorphisms of g by Gl( g). 

Definition 5.4.1 The adjoint representation of G is given by 

Ad : G -> Gl{ g) 
h i— > D e Int (ft) 

where e £ G is the identity element. 

In the notations of 5.3 we thus have 

0 p = Ad(<r p ). (5.4.1) 

Lemma 5.4.1 Ad is a group homomorphism, and for each h £ G, Ad ft. £ 
Gl(g) is a Lie algebra homomorphism, i.e. 

Ad h[X, Y] = [Ad hX, Ad hY] for all X,Y eg. (5.4.2) 


This result generalizes Lemma 5.3.4. 
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Proof. That Acl is a group homomorphism follows from 
Int(/ii/i 2 ) = Int(/ii)Int(/i 2 )- 

That Acl ft, is a Lie algebra homomorphism follows as in the proof of Lemma 
5.3.4. □ 

Definition 5.4.2 The adjoint representation of g is given by 

ad : g -> gl(g) 

X i-> (D e Acl) (A) 

where g[(g) is the space of linear self maps of g. 

Lemma 5.4.2 

(adX)Y = -[X,Y\. (5.4.3) 


Proof. 


(ad X)Y 


-D e lnt(e tx )Y lt=0 


dt 

d 2 


Int(e tA )e sy | t=8=0 


dtds 

[— X, Y] by Theorem 1.6.4 (ii). 


□ 


Corollary 5.4.1 

(ad X)[Y,Z\ = [(ad X)Y,Z\ + [Y,(&dX)Z\. 

Proof. From Lemma 5.4.2 and the Jacobi identity (Lemma 1.6.5). □ 

Corollary 5.4.2 e ac ^ A = Acle A for all leg. 

Proof. A e a dt *| i=0 _ adX = (D e Ad)X = ^ Ade tx \ t =o, which easily im- 
plies the claim. □ 

Definition 5.4.3 The Killing form of g is the bilinear form 

B : g x g -» R 

(X, Y) i — ► tr(adX oadF). 

0 (and likewise G ) is called semisimple if the Killing form of g is nondegen- 
erate. 



266 5. Symmetric Spaces and Kahler Manifolds 


Lemma 5.4.3 The Killing form B of g is symmetric. B is invariant under 
automorphisms of g. In particular 

B((Adg)X, (Ad g)Y) = B{X, Y) for all X, Y £ g, g £ G. (5.4.4) 
Moreover 

B((&dX)Y,Z) + B(Y,(&dX)Z) =0 for all X,Y,Z £ g. (5.4.5) 

Proof. The symmetry of B is a direct consequence of the formula 

tr (AC) = tr (CA) (5.4.6) 

for linear self maps of a vector space. 

Let now a be an automorphism of g. Then 

(ad aX)(Y) = [a(-X),Y\ by (5.4.3) 

= [er(— X),er a~ l Y] = a[—X,a ~ X Y] 

= (cr o adX o a~ 1 )(Y). 

Therefore 

tr(ad <jX ad aY) =tr(cradXad Y a -1 ) =tr(adXadT) with (5.4.6), 

i.e. 

B(aX, aY) = B{X, Y). (5.4.7) 

Therefore, B is invariant under automorphisms of g. We now choose 

a = Ad(e tx ). 

Differentiating (5.4.7) w.r.t. t at t = 0 yields (5.4.5). □ 

We also define 

K \= {g G G : g(j>) = p}. 

I\ then is a subgroup of G. For X £ 6, we have e tx £ K. 

We now have two scalar valued products on p. Namely, for Y, Z £ p, we 
may form (Y(p),Z(p)), where (., .) denotes the Riemannian metric of M, as 
well as 

B(Y, Z). 

We now want to compare these two products. 

Lemma 5.4.4 Ad K leaves p and the product (•,•) on p invariant. 


Proof. Since for k £ K,k(p) = p, for Y £ p, Int(fc) maps the geodesic 
exp p tY (p) through p onto another geodesic through p, and this geodesic is 
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generated by DkoY(k 1 (p)) = DkY(p). Therefore, (Ad k)(Y) = DkoY(k 1 ) 
is in p as well (cf. the proof of Theorem 5.3.6). Moreover, for Y, Z € p 

(Y (p),Z(p)) = ( Dk o Y ( p),Dk o Z(p)), since k is an isometry 

= (Dk o Y(k~ 1 (p)), Dk o Z(k~ 1 (p))) sincefc _1 (p) =p 
= (Ad kY(p), Ad kZ(p)). □ 


Corollary 5.4.3 The Killing form B is negative definite on t. 


Proof. Let X £ t,Y, Z £ p. By Lemma 5.4.4 

(Acl(e tA )F(p), Ad (e tx )Z(p)) = ( Y(p),Z(p )). (5.4.8) 

We differentiate (5.4.8) at t = 0 w.r.t. t and obtain 

(ad (X)Y(p), Z(p)) + ( Y(p ), ad (X)Z(p)) = 0. (5.4.9) 


By Theorem 5.3.7 or Lemma 5.4.4, ad X yields a linear self map of p, and 
by (5.4.9), this map is skew symmetric w.r.t. the scalar products (-,-)(p) on 
p. We choose an orthonormal basis of p w.r.t. (-,-)(p) and write ad X = 
( a ij)i,j=i,...,n as a matrix w.r.t. this basis. Since ad X is skew symmetric, we 
have 

atj = - aji for i, j = 1, . . . , n. 


Therefore 

n o 

B(X, X) = trad A o ad X = — E ajp 

i,i — l 

and negative definiteness follows, since for I £ !,I / 0, also ad I / 0 
because otherwise Ad e tx = id, hence by e tx £ K, De tx would be the 
identity of T p M , i.e. e tA ", i.e. X = 0. □ 


We now define the following scalar product on g : 

For Y, Z £ p 

(Y, Z) B := (Y (p), Z(p)), where the scalar product on the right hand 

side is the Riemannian metric on T p M , 

for X,W £t 

(X,W) Q := —B(X, W) 

for X G t, Y £ p 

(X,Y) a := 0. 

Lemma 5.4.5 (•,•) is positive definite and Ad K -invariant. 

Proof. Positive definiteness follows from positive definiteness of the Rieman- 
nian metric on T p M and Corollary 5.4.3. Ad A'-invariance follows from Lem- 
mas 5.4.3 and 5.4.4. □ 
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The infinitesimal version of the Ad Jt-invariance of (■, •) is 

((adX)F,Z) 0 + (y,(adX)Z) fl = O for Y,Z £g,X£t (5.4.10) 
For Y £ p, we now consider the linear functional 

p -> M 

X i-> B(X, Y), 

where B again denotes the Killing form of g. Then there exists Y* £ p with 

B(X,Y) = (X,Y*) g . 

Since B is symmetric (Lemma 5.4.3), the map 

P -*• P 

is self adjoint w.r.t. (■,■). Therefore, there exists an orthonormal basis 
Yi , . . . , Y n of eigenvectors: 

Y* = XjYj (j - 1 n). 

Then 

B{Yi,Yj) = (Y,Y*) = X j (Y i , Yj) 

= {Y j ,Y*) = \i{Y i ,Y j ). 

Thus, eigenspaces of different eigenvalues are orthogonal not only w.r.t. 

(•, •) , but also w.r.t. B. We write the decomposition of p into eigenspaces 
as 

P = Pl ® . . . © p m- 

The eigenvalue of p j is denoted by /. ij (J = 1, . . . m). 

Lemma 5.4.6 

[Pi, Pj] = 0 fori^j. (5.3.11) 

If g is semisimple, i.e. B nondegenerate, then 

(■> ') 0 = ~B\t + — B \pi + ■ • • + —B\ Pm . (5.4.12) 

pl pm 


Proof. Let 1) £ pi,Yj £ p j. Then 


B{[Yi,Yj\, \Yi, Yj}) = —B(Yj, [Y, [Y^Yj]}) by (5.4.3), (5.4.5) 
= -^{Y j ,[Y i ,[Y i ,Y j \]) 

= - N {Y i ,[Y j ,[Y j ,Y i ]]), 
for example by Corollary 5.3.5 and by 
the symmetries of the curvature tensor. 
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In the same manner, however, we also obtain 

[Y^Yj]) = -miYi, [Yj, [Yj^Yi]}), (5.4.13) 

and hence, since B is nondegenerate, we must have 

[Yi,Y j }= 0. 

Namely, by Theorem 5.3.7, [Y,, Yj\ £ 6, and by Corollary 5.4.3 B is negative 
definite on 6. That the restriction of (•,•) onto 6 coincides with — B\ t is a 
consequence of the definition of (•,•). Moreover, for Y,Z £ pj 

B(Y,Z)= N (Y,Z), 

and p, and pj for i ^ j are orthogonal w.r.t. (•, •) and B. This implies (5.4.12), 
because, since B is nondegenerate, all pj must be ^ 0. □ 

Definition 5.4.4 Let 0 = t © p be the usual decomposition of the space of 
Killing fields of the symmetric space M. 

M is called of Euclidean type, if 

[M = 0 , 

i.e. if the restriction of the Killing form vanishes identically on p. 

M is called semisimple, if g is semisimple. 

M is called of compact (noncompact) type, if it is semisimple and of 
nonnegative (nonpositive) sectional curvature. 

Corollary 5.4.4 A semisimple symmetric space is of (non) compact type if 
and only if B is negative (positive) definite on p. 

Proof. Since B is negative definite on p, all /q are < 0, and Corollary 5.4.3 
and (5.4.13) imply 

~(Yi, [Yj, [Yj, Yc]]) > 0, 

hence K > 0 by Corollary 5.3.5. If conversely K > 0, B must be negative 
definite on p, because otherwise we would contradict (5.4.13), since by Corol- 
lary 5.4.3 B([Yi,Yj], \Yi,Yj}) < 0. The case K < 0 is analogous. □ 


Perspectives. Symmetric spaces were introduced and investigated by E. Car- 
tan. They form a central class of examples in Riemannian geometry, combin- 
ing the advantage of a rich variety of geometric phenomena with the possibility 
of explicit computations. Moreover, symmetric spaces can be completely classi- 
fied in a finite number of series (like S n = SO (n + l)/SO(n), hyperbolic space 
H n = SOo(n, l)/SO(n), CP" = SU(n+l)/S(U(n)xU(l)), Sl(n,R)/SO(n),Sp(p + 
q)/Sp(p) x Sp(q), etc.) plus a finite list of exceptional spaces. Moreover, there ex- 
ists a duality between the ones of compact and of noncompact type. For example, 
the dual companion of the sphere S n = SO (n + l)/SO(n) is hyperbolic space 
H n = SOo(n, l)/SO(n). A reference for the theory of symmetric spaces is Helga- 
son[116]. 
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5.5 The Space Sl(n, R)/SO(n, R) 

We now want to consider examples: In fact, we shall specialize the examples 
of §1.7 to the case where we identify the vector space with R". 

Let M n be the space of ( n x n)-matrices over R (M n ~ R™ ) , 

Gl(n,R) := {A G M n : clet A y 0} (linear group) 

Sl(n,R) := {A G M n : clet A = 1} (special linear group) 

SO(?z) := SO(n,R) := {A G M n : A = A _1 , det A = 1} (special orthogonal 
group) . 

(Note that A * is the adjoint A* of A w.r.t. the Euclidean scalar product). 
Obviously, these are Lie groups. 

0 [(n,R) := M n 

when equipped with the Lie bracket 

[X, Y] := XY - YX 

becomes a Lie algebra, and so do 

s((n,R) := {X G M n : trX = 0} 
so(n) := so(n,R) := {X G M n : X* = - X }; 

these are the Lie algebras of Gl(n,R),Sl(n,R),SO(n, R). As in §1.7, one ver- 
ifies this by considering for X G 0 [(n,R) the exponential series 

e tx := Id + tX + jX 2 + ... . 

We have, 

det(e x ) = e tr ' Y , (5.5.1) 

as is easily seen with the help of the Jordan normal form. 

In particular, for all t G R 

e tx G Gl(n,R). 

By (5.5.1), if X G sl(n, R), then e x G Sl(n, R). Moreover, for X G so(n,R) 
(e x Y = Id + X* + ^(X 4 ) 2 + . . . = Id - X + i(X) 2 - . . . = e" x = (e x )“\ 
i.e. e x G SO(n, R). 

The series representation of e tx also easily implies that the derivative of 
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at A = 0 is the identity; note in particular that gl(n, R) and Gl(n, R) are of 
the same dimension. Therefore, the exponential map A i— > e x is a diffeomor- 
phism in the vicinity of X = 0. 

The exponential map then also yields a diffeomorphism between neigh- 
borhoods of 0 in sl(n, R) and so(n), resp., and neighborhoods of Id in Sl(n, R) 
and SO(n), resp., because the corresponding spaces again have the same di- 
mension. 

From §1.7, we recall that for A, B £ Gl(n,R), 

Int (A)B = ABA- 1 . 

Therefore, for X € gl(n, R) 

(Ad A)X = j t Ae tx A~ l | t=0 = AAA -1 (5.5.2) 

and for Y € gl(n, R) then 

o 2 

(ad Y)X = —e sY e tx e~ sY \ s = t=0 = YX — XY = [F X], (5.5.3) 

We now let E 11 £ m n be the matrix with entry 1 at the intersection of the 
i th row and the j th column and entries 0 otherwise, E l i = {eVAk,e=i,...,n- 
Then with X = ( xu),Y = (y ke ) 

ad X ad Y E lJ = {x k eyim^ h - xue^ m y mh - yue^Xmh + e^ e xe m y m h)k,h=i,...n 
and hence 


tr ad A ad F = {E ij , ad X ad YE ij ) 

= nxij'yji - Xuyjj - yuXjj + nXjiyij (5.5.4) 

= 2n tr XY — 2 tr A tr F. 

If A = A Id (Id = identity matrix) , then 

ad A = 0. 

Therefore, gl(n, R) is not semisimple. On sl(n, R), however, the Killing form 
satisfies by (5.5.4) 

B(X,Y) = 2?rtr AF. (5.5.5) 

Therefore, for A/0 

B( A,A*)>0, (5.5.6) 

and the Killing form is nondegenerate. 

A similar computation applies to so(?r) : so(2) = R is not semisimple. For 
n > 2, we choose {^{E 1 - 1 — E^) : i < j} as a basis for so(n). Then 

( -4 (E ij - E ^). , (E kC -E ik )) = 6 ik S je for i<j,k<£ 
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and 

tr ad A ad F = (2=( E ij - E ji ),adX a,dY^(E ij - E ji )) 
v 2 v 2 

1 kpy tk T % ijUij 

using tr A = trF = 0 for A, Y £ so(?i). 

Since X = —X t for X £ so (n), we obtain 

tr ad X ad Y = (n — 2) tr X ■ Y. 

In particular, for n > 2 let B now denote the Killing form of so(n), then for 
X^O 

Bpf,X 4 ) < 0, 

and also 

B(X,X) < 0. 

Thus, the Killing form of so(n) is negative definite for n > 2. Note that the 
Killing form of so(n) does not coincide with the restriction of the Killing form 
of sl(n, K) onto so(n). In the sequel, we shall employ the latter one. 

(5.5.5) directly implies that B is Ad (Sl(n,R)) invariant. We now put 

G = Sl(n,M),K = SO(n), 

g = sI(n,M),t = so(n),p = {X € sl(n,R) : X 4 = X}. 

Then because of X = \{X - X*) + \{X + A*), 

0 = I®p. (5.5.7) 

Moreover, because of (XY — FA) 4 = Y t X t — X t Y t 

[t,t] Ct,[p,p] Ct,[t,p] Cp. (5.5.8) 

Next, let 

M := G/K\ 

more precisely, M is the space of equivalence classes w.r.t. the following 
equivalence relation on G : 


3i ~ 32 : 3k£ K : g 2 = g\k. 

Thus, M is the space of left cosets of K in G. As K is not a normal subgroup of 
G, M is not a group. We want to equip M with a symmetric space structure. 
G operates transitively on M by 


Let 


g'K i— > gg'K for g £ G. 
tt : G —> M 


be the projection. 
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A subset 1 ? of M is called open, if is open in G. Then 7 r becomes 

an open map. 

We want to show that M is a Hausdorff space. The preimage of I\ under 
the continuous map 

GxG^G 

(51,52) !->■ 5 i _1 52 

is closed since K is closed in G. Thus, if gf 1 g 2 K, in G x G there exists 
a neighborhood of (51,52) of the form G\ x J?2 which is disjoint from the 
preimage of K. If now g\K 7 ^ gi K, then gf g 2 fL K, and 17 := tt (17;), i = 1,2, 
are disjoint neighborhoods of g\K and g^K. Namely, if gK £ 17;, there exists 
ki £ K with gki £ f2i, and if we had gK £ fii fl 1 ? 2 , {gki, gk^) would be 
mapped to k^ 1 k 2 G K, and 17 1 x G 2 would not be disjoint to the preimage 
of K. This shows the Hausdorff property. 

In order to construct coordinate charts, we first have to recall the Cauchy 
polar decomposition of an invertible matrix. 

Lemma 5.5.1 For A £ Gl(n, K), there exist an orthogonal matrix R and a 
symmetric positive definite matrix V with 

A = VR, 


and this decomposition is unique. 

Proof. Since A is invertible, 

H := AA f 

is symmetric and positive definite. We are going to show that there exists a 
unique symmetric, positive definite matrix V with V 2 = H. For this purpose, 
we first observe that H may be diagonalized by an orthogonal matrix S : 

H = S t AS with yl = diag(A.j), \i > 0 by positive definiteness. 

We put 

V :=S* diag( V / A)^. 

V then is symmetric, positive definite, and because of 5* = S' -1 , it satisfies 

V 2 = H. 

This shows existence. For uniqueness, we first show that for a symmetric, 
positive definite matrix V, each eigenvector of V 2 with eigenvalue A is an 
eigenvector of V with eigenvalue As. Namely, from V 2 x = \x it follows that 

(V+ \/AId)(V- \/AId)a: = 0, 

and therefore we must have y := (V — -\/AId)a; = 0, because otherwise y 
would be an eigenvector of V with eigenvalue — -\/A < 0, contradicting the 
positive definiteness of V. 
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This implies that the relation V 2 = H uniquely determines V, because all 
eigenvalues and eigenvectors of V are determined by those of H. 

We now put 

R = U _1 A 

Then 

RR* = r 1 ,1.4' V' 1 = V-^V- 1 = Id, 
and R is orthogonal. 

This shows the existence of the decomposition. Uniqueness is likewise 
easy: 

If 

A = VR, with orthogonal R and with symmetric, positive definite V, 
then 

A A* = VRJAV 1 = V 2 , 

and by the preceding, this uniquely determines V. R then is unique as well. 

□ 


Let 

P:={Ae SI (n,R) : A* = A, A pos. def.} 

(Note that P is not a group.) 

For X G p, then 

e x G P 

and the exponential map again yields a diffeomorphism between a neighbor- 
hood of O in p and a neighborhood of Id in p. We now decompose A G Sl(n, R) 
according to Lemma 5.5.1 

A = VR 

with R G O(n), V G P. 

Let A be contained in a sufficiently small neighborhood of Id. 

There then exist unique 

X G so (n),Y G p 

with 

e* = R, e Y = V. 

This implies the existence of neighborhoods fl\ of 0 in p, of 0 in so(n) for 
which 

f? i x — * G 

(Y, X) ^ e Y e x 

is a diffeomorphism onto its image. 

Lemma 5.5.2 G/K is homeomorphic to P. If G/I\ is equipped with the 
differentiable structure of P, 
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exp : p — > G/K ~ P, V ^ e v 

becomes a local diffeomorphism between a neighborhood of 0 in p and a neigh- 
borhood of Id K in G/K. 

Proof. We first construct a homeomorphism d> between G/K and P. For gK 
we write by Lemma 5.5.1 

g = VR with V € P,R € SO(n) 


and put 


<%) = V. 


This does not depend on the choice of representative of gK. Namely, if gK = 
g'K , there exists S £ SO(n) = K with gS = g' , hence g' = VRS = VR' with 
R' := RS £ SO(n), and <P(g') = V = d>(g). If conversely d>(g) = d’(g') =: V, 
then g = VR, g' = VS with R, S £ SO(n), hence g' = g(R~ 1 S) with R~ 1 S £ 
SO(n), hence gK = g'K. Therefore, d> is bijective. d> is continuous in both 
directions, because 

7r : G — > G/K 


and 

7r : G — * P 

A^V, 

with A=VF (the unique decomposition of Lemma 5.5.1), both are continu- 
ous and open. 

Moreover exp(p) C P, and since exp : g[(rc,R) — > Gl(n,M) is a local 
diffeomorphism, and p and P have the same dimension, exp| p is a local dif- 
feomorphism, too, between a neighborhood of 0 in p and a neighborhood of 
Id in P. □ 


By Lemma 5.5.2, G/K becomes a differentiable manifold. We have already 
displayed a chart near Id • K. In order to obtain a chart at gK, we simply 
map a suitable neighborhood U of gK via g onto a neighborhood g~ 1 l J of 
Id • K and use the preceding chart. 

G then operates transitively on G/K by diffeomorphisms, 

G x G/K -> G/K 
( h , gK) i-> hgK 

The isotropy group of Id • K is K itself. The isotropy group of gK is gKg _1 , 
and this group is conjugate to K. 

We want to construct Riemannian metrics on G on G/K w.r.t. which G 
operates isometrically on G/K. 

For this purpose, we use the Killing form B of sl(n, R) and the decompo- 
sition g = s[(n,R) = t ® p (with 6 = so (n)). We put 
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( B(X , Y) for X,Y £p 

(X,Y) a :=l-B{X,Y) for X,Y£t 

[ 0 for X £ p, Y £ £ or vice versa. 

By (5.5.5), (-,■) is positive definite. 

For abbreviation, we put 

e := Id (identity matrix) 

and we identify 0 with T e G. For each g £ G, we then also obtain a metric on 
T g G by requesting that the left translation 

L g :G^G 

h i— > gh 

is an isometry between T e G and T g G ( dL g : T e G —> T g G). We also obtain a 
metric on G/K : restricting (-, •) to p, we get a metric on T eK G/K ~ p; the 
metric on T 9 kG/K then is produced by 

L g : G/K -► G/K 
hK i — ► ghK 

by requesting again that those maps are isometries. 

The metric is well defined; namely, if 

gK = g'K , 


then 


g' = gk with k £ K, 


hence L g > = L g o L fc . L fc now maps eK onto itself, and dLk : T e xG/K — > 
T eK G/K is an isometry, since for V £ P, L k V = kV = (kVk~ 1 )k = 
((Intfc)F)fc, hence dL k (X) = (Adfc)X for X £ p ~ T e xG/K, and Adfc 
is an isometry of p because it leaves the Killing form invariant. Therefore, 
the metric on G/K is indeed well defined. By definition, G then operates 
isometrically on G/K. 


We want to define involutions on G/K so as to turn G/K into a symmetric 
space. 

We first have an involution 


with 


hence 


a e :G^G 

h e-> (hr 1 )* 

da e : 0 -> g 
X i — ^ -X*, 

da e \ t = id, d<r e |p = -id, cr e \ K = id. 
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For g £ G, we then obtain an involution 


G-> G 


by 

a g h = L g a e {L g -ih) = g^g^h) -1 )* = gg t {h~ 1 ) t . 

We have 

^(h) = gg t (((gg t (h- 1 Y)- 1 ) t ) = h, 

hence 

a 2 g = id 

and 

°g{9) = 9- 

Since cr e \ K — id, a e induces an involution 


a eK : G/K G/K 


with a e x{eK) = eK,da e x '■ T e xG/K — > T e xG/K , dcr e if = —id. Since G 
operates transitively on G/K, at each gA' G G/K, we then also obtain an 
involution er^ = L g o a e x ° L g -i . 

We have thus shown 


Theorem 5.5.1 G/K carries a symmetric space structure. □ 

The group of orientation preserving isometries of G/K is G itself. Namely, 
that group cannot be larger than G, because any such isometry is already 
determined by its value and its derivative at one point, and G operates transi- 
tively on M = G/K, and so does K on T eK M, and hence G already generates 
all such isometries. 

We want to establish the connection with the theory developed in 5.3 and 
5.4. We first want to compare the exponential map on sl(n, M) and the induced 
map on G/K with the Riemannian exponential map. Let a one parameter 
subgroup of G be given, i.e. a Lie group homomorphism 

ip : R — > G. 

Thus ip(s + t) = (p(s) o (p(t), hence 

cp(t + h) - <p(t) <p(h)- 1 

1 = 

hence 


= o) 


ip(t) = e * dt 


As usual, this implies 
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Thus, the exponential map generates all one parameter subgroups of G. 

If c is a geodesic in G/K with c(0) = eK =: p, the translations r t along 
c yield a one parameter subgroup of G, hence 

exp p tc(0) = c(t) = T t (p ) = e tx {p) for some X T eK G/K = p. (5.5.9) 

Here, on the left, we have the Riemannian exponential map, whereas on the 
right, we have the one of G. Since the derivative of the Lie group exponential 
map at 0 is the identity, we obtain X = c(0), and the two exponential maps 
coincide. In particular, the Lie group exponential map, when applied to the 
straight lines through the origin in p, generates the geodesics of G/K. 

We also obtain a map ip from the Lie algebra sl(?r,R) of Sl(n, R) into the 
Lie algebra of Killing fields of G/K. For X £ sl(n, R) we put 

= ^ge tX (p)\ t= o for q = g(p) 

= 

Now 

iP{XY){q) = dgXY (p) 

= ios ge ' Xe ' r(j,) i— « 

= ^(^)(5e tJf (p))| t =o 
= iP(Y)iP(X)(q) 

hence 

iP([X,Y}) = [i/(Y),iP(X)\ = -[V>(X),V>(F)]. 

We thus obtain an antihomomorphism of Lie algebras. This explains the 
difference in sign between (5.4.3) and (5.5.3). 

Corollary 5.4.1 Sl(n, R)/SO(n) is a symmetric space of noncompact type. 
The sectional curvature of the plane spanned by the orthonormal vectors 
Yi,Y 2 £ p is given by 

K = H([y 2 ,Ki],[y 2 ,F 1 ]) = -||[Yi,y 2 ]|iS <o. 


Proof. As observed above ((5.5.5)), the Killing form is nondegenerate, and 
the symmetric space is semisimple. By Corollary 5.3.5 the sectional curvature 
of the plane spanned by Y \ , Y 2 £ p satisfies 

K — ( [ [hi , Y 2 ] , Y 2 ] , Y \ ) 

= -B([[ri,y 2 ],y 2 ],Yi) 

= -B([y 2 ,[y 2 ,yi]],Yi) 

= H([F 2 ,y 1 ],[F 2 ,y 1 ]), 


(5.5.10) 
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because the Killing form is AdG invariant. 

This expression is < 0, because by [p, p] C {, \Y%, Yi] £ t and B is negative 
definite on !. □ 

Definition 5.5.1 A subalgebra a of g is called abelian if [A\, A 2 ) = 0 for all 
A \ , A 2 £ a. 


We want to find the maximal abelian subspaces of p. Let a be an abelian 
subspace of p, i.e. an abelian subalgebra of jj that is contained in p. Thus 

AiA 2 — A 2 Ai = 0 for all Ax,A 2 € a. 

The elements of a therefore constitute a commuting family of symmetric 
(nx n ) matrices. Hence, they can be diagonalized simultaneously. Thus, there 
exists an orthonormal basis Vi, ... ,v n of R" consisting of common eigenvec- 
tors of the elements of a. We write our matrices w.r.t. an orthonormal basis 
ei, ... ,e n of R", and we choose S € SO(n) with 

S(vi) = ±e.j (i = 1, . . . ,n). 

SaS^ 1 then is an abelian subspace of p with eigenvectors ei, ... ,e n . Thus, 
all elements of S'a.S' -1 are diagonal matrices (with trace 0 since they are 
contained in p). This implies that the space of diagonal matrices of trace 0 is 
a maximal abelian subspace of p. Furthermore, it follows that each maximal 
abelian subspace is conjugate to this one, w.r.t. an element from K = SO(n). 
Therefore, any two maximal abelian subspaces of p are conjugate to each 
other. 

Let now a be an abelian subspace of p. We put 

A := exp a, 

where exp, as usual, is the exponential map g —* G. A then is a Lie subgroup 
of G. For (j \ , (J 2 € A, we have 


5 i 52 — g 2 ffi, 

because for any two commuting elements X,Y £ g 

e x+Y = e x e Y = e Y e x , 

as is easily seen from the exponential series. Thus, A is an abelian Lie group. 
On the other hand, because of a C p A also is a subspace of M = G/K. 

Lemma 5.5.3 A is totally geodesic in M and flat, i.e. its curvature vanishes. 

Proof. Let Y £ a. By definition of A , the geodesic e tY is contained in A. A is 
thus totally geodesic at the point eK — P in the sense that any geodesic of 
M through p and tangential to A at p is entirely contained in A. A operates 
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transitively and isometrically on itself by left translations. Let now q £ A. 
There then exists a £ A with ap = q. Since a as element of G is an isometry, 
it maps the geodesics of A and those of M through p onto geodesics through 
q. This implies that A is totally geodesic A1 at q as well, hence everywhere. 
The curvature formula (5.5.10) implies that A is flat. □ 

Let conversely N be a flat subspace of M. Since the Killing form of 1 is 
negative definite, the curvature formula (5.5.10) implies [Yi, Y 2 ] = 0 for all 
Yj , Y -2 £ T P N. Thus, T p N is an abelian subspace of p. 

We conclude 

Corollary 5.5.2 The maximal flat subspaces of M through p = eK 1 i.e. those 
not contained in any larger flat subspace of M, bijectively correspond to the 
maximal abelian subspaces of p. □ 

The assertions of Lemma 5.5.3 and Corollary 5.5.2 are valid for all sym- 
metric spaces. 

Definition 5.5.2 The rank of a symmetric space M is the dimension of a 
maximal flat subspace. 

Thus, the rank is the dimension of a maximal abelian subalgebra of g 
contained in p. As remarked above, any two such subalgebras are conjugate 
to each other. Likewise, because G operates transitively on M, the dimension 
of a maximal flat subspace through any given point of M is the same. 

Corollary 5.5.3 Rank (Sl(n, R)/SO(n)) = n — 1. 

Proof. As observed above, a maximal abelian subalgebra of g contained in p 
consists of the space of diagonal matrices with vanishing trace, and the latter 
space has dimension n — 1. □ 

Corollary 5.5.4 A symmetric space M of noncompact type has rank 1 if 
and only if its sectional curvature is negative. 

Proof. The rank is 1 if for two linearly independent Y\, Y 2 £ T p M, we have 
[Yi,Y 2 ] ^ 0. Since B is negative definite on £ and [Lj , Y 2 ] £ £ for Yi, Y 2 £ T p M 
(identified with p), (5.5.10) yields the claim. □ 

Lemma 5.5.4 For X £ t &dX : g — * g is skew symmetric w.r.t. (■, •) , and 
for X £ p, it is symmetric. 

Proof. Let X £t,Y,Z £t Then (ad X)Y = [ X , Y] <E £, hence 
([X,Y],Y) 0 = -B([X,Y],Z) = B(Y,[X,Z}) 

= ~(Y, [X,Z]) Q by (5.4.5) . 
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For A G 6 , Y S p, 2 € f, we have [X, Y ] G p, [X, Z] G 8 , hence 
(\X,Y\,Z) a =Q= (Y,[X,Z\) a . 

For X G 8 , Y, Z G p, we have [X, Y] G p, [X, 2] G p and 

{[X,Y\,Z ) 0 = B([X,Y],Z) = - B(Y,[X,Z ]) 

= -<Y,[A,Z ]} 0 by (5.4.5) . 

Altogether, this implies that ad A is skew symmetric for X G 8 . Let now 
A G p, Y, Z G 8 . Then [X, Y] G p, [X, 2] G p, hence 

([A,Y],Z) 0 =O=(Y,[A,Z ]) 0 

For A G p, Y G t,2 G p, we have [A, Y] G p, [A, Z) G 6 , hence 

<[A,Y],Z> fl = B([A,Y],Z) = - B(Y,[X,Z ]) by (5.4.5) = <Y,[A,Z]} 0 . 
Finally for A G p, Y, 2 G p, we have [A, Y] G 8 , [A, Z\ G 8 , hence 
([A, Y], Z) fl = 0 = (Y, [A, Z]) fl . 

Altogether, this implies that ad A is skew symmetric for A G p. □ 

Lemma 5.5.5 If A, Y G 0 commute, i.e. [A, Y] = 0, then so do ad A and 
adY. 

Proof. 


ad A ad YZ = [A, [Y,Z]} 

= — [Y, [. Z , A]] — [Z, [A, Y]] by the Jacobi identity 
= [Y, [A, Z]\ because of [A, Y] = 0 

= adYadAZ. □ 


Let now a be a fixed maximal abelian subspace of p. By Lemmas 5.5.4, 
5.5.5, for A G a, the maps ad A : 0 — > 0 are symmetric w.r.t. (•, •) and com- 
mute with each other. Therefore, 0 can be decomposed as a sum orthogonal 
w.r.t. (■,■) of common eigenvectors of the ad A, A G a : 

0 = So © X] 0Q ' 

aGA 


Definition 5.5.3 A is called the set of roots , and the a G A are called the 
roots of 0 w.r.t. a. 
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We have 

[X,Y] = (adX)Y = a(X)Y forl€a,Feg a . (5.5.11) 

Thus a(X) is the eigenvalue of adX on g a , with 0(X) := 0 for all X. Since 

0 is abelian, of course 

a C 0 O . 

Moreover, a : a — > R. is linear for all a € A, since 

ad(X + Y) = ad X + ad Y, ad(pX) = pad X for X, Y G a, ji G K. 

We now recall the involution 

a e : G - G, a e (h) = (/T 1 ) 4 . 
e := da e : 0 -> 0 , 0(X) = -X* 

which is also called Cartan involution , and the decomposition 

0 = t ® P, 

1 being the eigenspace of 9 with eigenvalue 1 , p the one with eigenvalue — 1 , 
is called Cartan decomposition. We thus may write 

(X,Y) b = -B(X,9Y). (5.5.12) 

In the same manner as e does, any element g of G, hence also any element 
gK of G/K induces a Cartan decomposition 0 = t' © p' with i' = Ad(<jr){ 
etc. (cf. also 5.3). 

Lemma 5.5.6 

(i) [0a, 0/3] c 0 a +/3 for a + (3 G A, [ 0 a, 0 / 3 ] = 0 for a + (3 i A. 

(ii) a € A —a G A, and for each a G A, 9 : 0 a — > 0- a is an 

isomorphism. 

(in) 9 leaves 0 o invariant, 0 o = 0 o H t + a. 

(iv) For lGa,y G 0 Q , Ad{e tx )Y = e^^Y. 

(v) For a. ^ -(3, B(g a , g 0 ) = 0. 

Proof. Let Y G 0 a , Z G gp, X G a. Then 

(ad X) [Y, Z ] = [X, [Y, Z\] = - [Y, [Z, X}} - [Z, [X, Y\\ 

because of the Jacobi identity 
= (3{X) [Y, Z\ + a(X) [Y, Z] = (a + 0) {X) [Y, Z ] . 


This implies (i). 
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Next 

[X, BY] = [X, — y 4 ] = -[X*, y 4 ] (X = X\ since X G a C p) 

= [X,y ] 4 = a(X)y* = -a(X)BY, 

hence BY G g_ a . This proves (ii), and the first part of (iii), too, hence also 
So = (oo ne) + (go n p). 

Since a is maximal abelian in p and commutes with all elements of go, it 
follows that go H p = n which is the remaining part of (iii) . 

Next 

OO J.TL 

Ad(e tJf ) = e tad * = Id + ^ ^y(adX)” 

n — 1 

which implies (iv). 

Finally, (v) follows from 

0= (0a,0/3 } 0 fora^/3 

= -B{Q a ,B(Q 0 )) by (5.5.11) 

= --B(0a,0-/3) by (ii) . □ 


We now want to determine the root space decomposition of g = si(n, R). 
For that purpose, let E 13 be as above, and 

H i ■- e u - E i+1 ’ i+1 , i = l,...,n-l. 

{E l3 (i 7 ^ j) and H k (k = 1, . . . , n — 1)} then constitute a basis of g. Let o be 
the space of diagonal matrices with vanishing trace, i.e. a maximal abelian 
subspace of p. 

n 

For X = diag(Ai, . . . , A„) = E \E™, we have 

i- 1 

(ad X)E ij = (A, ; - A j)E ij for i ± j 

(ad X)H l = 0 for i = 1, . . . , n — 1, since H l G o. 

We thus obtain n(n — 1) nonzero roots otifli j) with 

^ij 0 (^ di ag ( A i , . . . , A n )). 

The corresponding root spaces g aij are spanned by the E 13 . g 0 is spanned by 
H 1 , . . . , H n ~ l ] in particular 

0o = a- 

Definition 5.5.4 A maximal flat abelian subspace of G/K is called a flat. 
A geodesic in G/K is called regular if contained in one flat only; otherwise it 
is called singular. Tangent vectors of regular (singular) geodesics are called 
regular (singular). 
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Lemma 5.5.7 X £ a is singular iff there exists Y £ g\g 0 with [X, Y] = 0, 
i.e. if there exists a € A with a(X) = 0. 


Proof. Let X be singular. Then X is contained in another maximal abelian 
subspace a' of p besides a. Therefore, there exists Y £ a',Y ft. a. Because of 
X,Y £ a', 

[X,Y]= 0. 

Since g 0 n p = a (Lemma 5.5.6 (ii)), Y f g 0 . (5.5.11) implies a(X) = 0 for at 
least one a £ A. 

Let now a(X ) = 0 for such a a £ A. Let Y £ g a , F/0. Then 


[X, Y] = a(X)Y = 0. 

(5.5.13) 

We decompose 


Y = Y { + Y p with Y t £ tY p £ p. 

(5.5.14) 

For A £ u, we have because of Y £ g a 


[ A,Y\=a(A)Y , 

(5.5.15) 

and because of [6, p] C p, [p, p] C t, (5.5.14), (5.5.15) imply 


[A,Y t ] = a(A)Y p 

(5.5.16) 

[A,Y P ] = a(A)Y t . 

(5.5.17) 


If we had Y p = 0, then by (5.5.16) also Yj = 0, since a does not vanish on a, 
hence Y = 0. Likewise, Yf cannot vanish. By (5.5.17), Y p thus is contained 
in p\o. Since (5.5.13) - (5.5.17) imply 


[X,Y p ]=0, 


X and Yp are contained in some abelian, hence also in some maximal abelian 
subspace of p different from a. Thus, X is singular. □ 


By Lemma 5.5.7, the singular elements of a constitute the set 
u s i n g = {df £ u ; £ A : cx(X') = 0}. 

Using thus is the union of finitely many so called singular hyperplanes 
{X £ u : a(X) = 0} for a £ A. 

Likewise, the set of regular elements of u is 

a reg = {X £ u : \/a £ A : a(X) ± 0}. 

The singular hyperplanes partition u reg into finitely many components which 
are called Weyl chambers. 

n 

For g = sl(n, R), u = {diag(Ai, . . . , A„), E K = 0}, we have 
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casing {diag(Ai, . . . , A n ) : 3i ^ j : A ? ; — Xj , ^ ' Xi — 0} , 

i=l 

the space of those diagonal matrices whose entries are not all distinct. This 
follows from the fact that the roots are given by 

o^(diag(Ai, . . . , A„)) — A^ A j 


as computed above. 

One of the Weyl chambers then is 

a+ := {diag(Ai, . . . , A„) : Ai > A 2 > . . . > A„, EX :j = 0}. 

We call 

A + := {a £ A : MA £ a+ : a(A) > 0} 

the space of positive roots (this obviously depends on the choice of a + ). In 
our case, 

A + = {a i:j : i < j}. 

A := {ai 2 , a 23 , . . . , a n -i : n} C A + then is a fundamental system of positive 
roots, meaning that each a £ A + can be written as 

n— 1 

^ ^ 1 

i= 1 

with some .s, £ N. For abbreviation, we put a* := i = 1, . . . , n — 1. 

The sets 

{iea: > 0 for v = 1, . . . , r, 0 ^( 2 !) = 0 for v = r + 1, . . . , n — 1}, 

where {ii , . . . , i n -i} — {l,...,n — 1 }, then are the r'-dimensional “walls” 
of the Weyl chamber a + . The relation “is contained in the closure of” then 
defines an incidence relation on the space of all Weyl chambers and all Weyl 
chamber walls of all maximal abelian subspaces of p. This set with this inci- 
dence relation is an example of a so-called Tits building. Via the exponential 
map, we obtain a corresponding incidence structure on the set of all flats and 
all images of Weyl chamber walls through each given point of G/A'. 

We next introduce the Iwasawa decomposition of an element of Sl(n, R) = 
G. Let, as before, 

K = SO(n), 

and moreover 

n 

A := {diag(Ai, . . . , A„) : Aj > 0 for i = 1, . . . , n, JJ A, = 1} 

i—1 

N := {upper triangular matrices with entries 1 on the diagonal} 



286 5. Symmetric Spaces and Kahler Manifolds 


Theorem 5.5.2 (Iwasawa decomposition). We have 

G = KAN. 


More precisely, for each g £ G there exist unique k £ K,a £ A,n £ N with 

g = kan. 


We first prove 

Lemma 5.5.8 For each g £ Gl(n,R), there exists a unique h £ O(n) with 

( hg)ij = 0 for i < j 
( hg)u > 0. 


Proof. We denote the columns of g by tq, . . . , v n . The rows rq, . . . , r„ of h £ 
O(n) satisfying the assertions of the lemma must satisfy 

(i) rq, . . . , r„ is an orthonormal basis of R" (since h £ 0(?i)). 

(ii) r : j ■ Vi = 0 for i < j. (“•” here denotes the Euclidean scalar product). 

(iii) r : j ■ v j > 0 for all j. 

Conversely, if these three relations are satisfied, h has the desired prop- 
erties. 

We first determine r n by the conditions 

r n ■ r n = 1, r n ■ v n > 0, r n ■ Vj = 0 for i = 1, . . . , n - 1. 

Since the columns of g, i.e. the Vi, are linearly independent, there indeed exists 
such an r n . Assume now that we have iteratively determined rj, rj+\ , . . . , r n . 
Let Wj be the subspace of R" spanned by Vi, ... ,Vj-2,rj, ... ,r n . Wj then has 
codimension 1 because of the properties of the vectors rj, , r n . Then rj _ i 
has to be orthogonal to Wj and satisfies rj-i ■ Vj-i > 0 and r,_i • rj - 1 = 1. 
There exists a unique such Tj-\. Iteratively, we obtain n, . . . , r n , hence h. 

□ 

Proof of Theorem 5.5.2: By Lemma 5.5.8, there exist k £ SO(n), namely 
k = /i _1 from Lemma 5.5.8 (for g £ Sl(n,R), we get h. £ SO(ra)) and an 
upper triangular matrix m = ( ) with positive diagonal entries with 

g = km. 

We put A i := mu,nu = 1 ,nij = for i ± j, a = diag(Ai, . . . , A„), 

n = (jiij) and obtain 

g = km = kan. 

The uniqueness of this decomposition is implied by the uniqueness statement 
of Lemma 5.5.8. □ 
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5.6 Symmetric Spaces of Noncompact Type as 
Examples of Nonpositively Curved Riemannian 
Manifolds 

We continue to study the symmetric space M = Sl(n, R)/SO(n). It is com- 
plete (Corollary 5.3.1), nonpositively curved (Corollary 5.5.1), and simply 
connected (this follows from Lemma 5.5.2 since P is simply connected). Thus, 
the constructions at the end of § 4.8 may be applied to M. (Actually, what fol- 
lows will be valid for any symmetric space of noncompact type.) We continue 
to use the notations of 5.5, e.g. G = Sl(n,K),iv = SO(n). 

For x € M(oo), let 


G x ■= {g G G : gx = x} 

be the isotropy group of x. G x then is a subgroup of G. Let g^ be the corre- 
sponding sub Lie algebra of g. 

Theorem 5.6.1 Let x € M(oo),p € M, g = t (Bp be the Cartan decompo- 
sition w.r.t. p. Let X be the element of p = T p M with 

c px (t) = e tx {p) (= exp p tX) . 

Let a be a maximal abelian subspace of p with Ifo, and let 

0 = 0o + 

aeA 

be the root space decomposition of g determined by a. Then 

0* = 0o + (5.6.1) 

a(X)>0 


Corollary 5.6.1 Let be Weyl chambers or Weyl chamber walls with 

Bi C B 2 . Let Xi € B i,X 2 € B 2 ||A' 1 || = ||X 2 || = 1, # 1 , 2:2 € M( 00 ) be the 
classes of asymptotic geodesic rays determined by X\ and X 2 , resp. Then 

G X2 C G X1 . (5.6.2) 

Conversely, G X2 C G Xl implies B\ c B 2 . 

Proof of Corollary 5.6.1: B\ and B are contained in a common maximal 
abelian subspace 0 of p. Let A be the set of roots of the root space decom- 
position of g determined by n. Each a & A which is nonnegative on B 2 then 
is nonnegative on B\, too. Theorem 5.6.1 then implies the claim. □ 
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By Corollary 5.6.1, the geometric relation Bx C B 2 defining the incidence 
relation for the Tits building may be replaced by the algebraic relation (5.6.2) 
between subgroups of G. 

Proof of Theorem 5.6.1: For abbreviation, we put 

c(t) . Cp X {t) . 

Let Y G g. We decompose 

Y = Y 0 +yY a with Y 0 G 00, Y a G g a 

a&A 

and put 

Y(t) := Acl(e _tA ')F = Y 0 + ^ e~ ta(x) Y a by Lemma 5.5.6 (iv). (5.6.3) 

Qg/l 

Then for all s, t G JR. 

d(e sY c{t),c{t)) = d(e sY e tx (p),e tx (p)) 

= d{e~ tx e sY e tx (p),p), since e tx is an isometry of M 
= d(Ad(e~ tx )e sY (p),p) (5.6.4) 

= d(e sY ^(p),p). 

Let now 

Y G go + 0 q. 

a(Jf)>0 

We put 

Y':=Y 0 + Y °- 

a{X)=0 

(5.6.3), (5.6.4) imply for each s 

lim d 2 (e sY c(t) , c(t)) = d 2 {e sY (jp),p). 

t—> OO 

Since by Theorem 4.8.2, d 2 (e sY c(t) , c(t)) is convex in t, it has to be bounded 
for t > 0. Hence e sY c is asymptotic to c, hence 

e sY G G x for all s, 

hence 

L £ 01- 

Let conversely F G 0j. We write Y = Y[+Y 2 with Lj :=Yq+ fP Y a ,Y 2 := 

a(X)>0 

fP Y a . By what we have just proved, we obtain 

a(JC)<0 

Yx G 0 a ,, hence also Y 2 = Y — Yx € q x . 
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Therefore, for any fixed s, 

d 2 (e s ^ 2 c(t),c(t)) 

is bounded for t > 0. On the other hand (5.6.3), (5.6.4) imply 

lim d 2 (e sY c(t) , c(t)) = 0. 

t — > — oo 

Since this function is convex by Theorem 4.8.2, it then vanishes identically. 
We obtain 

e sY2 c(t) = c(f) 


hence in particular 


e s ' L2 p = p, 


hence Y 2 £ f. Therefore, letting 0 p denote the Cartan involution at p , 
Y 2 = 0 p (Y 2 ) £0 p ( ^2 0a) since 9 p \ t = id| { , 

a(X)<0 

= ^2 0 Q by Lemma 5.5.6(h). 

a(X)>0 

By definition of Y 2 , this implies dim Y 2 = 0, hence 

I = Y\ £ go + ^2 0a- 

a(X)>0 


□ 


Remark. The isotropy groups of any two points p,q £ M are conjugate. If 
q = gp , then 

G q = gG p g _1 . 

(The isotropy group of p £ M is by definition G p = {g £ G : gp = p}.) 

The isotropy groups of points in M( oo), however, are not necessarily 
conjugate as one sees from Theorem 5.6.1. However, there are only finitely 
many conjugacy classes. 


Example. Let 


X = diag(Ai,...,A„) 

and let x be the element in M( oo) determined by X. 
Then 


0X = {A = £ s((n,M) with a = 0 for A, ; < Ay}. 

For example, if 

Ai ••• ^ A n > 

then 0j. is the space of upper triangular matrices. 
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Perspectives. For a differential geometric treatment of symmetric spaces of non- 
compact type, our sources and references are [63, 64], [12]. 

Let G jK be a symmetric space of noncompact type. A discrete subgroup F of 
G is called a lattice if the quotient F\G/K has finite volume in the induced locally 
symmetric metric. Here, F operates on G/K by isometries since the whole group 
G does, r may have fixed points so that the quotient need not be a manifold. Any 
such r, however, always contains a subgroup F' of finite index which is torsion 
free, i.e. operates without fixed points (i.e. there do not exist 7 £ F' ,7 id, and 
2 € G/K with 7 z = z), and the quotient F\G/I\ then is a manifold and a finite 
covering of F\G/K. Therefore, one may usually assume w.l.o.g. that F itself has no 
fixed points, and we are hence going to do this for simplicity of discussion. A lattice 
r is called uniform or cocompact if the quotient is compact, nonuniform otherwise. 

We now discuss the rigidity of such lattices. 

For G = S1(2,R) ~ SOo(2, 1) and K = SO(2), there exist continuous families 
of compact quotients, namely Riemann surfaces of a given genus p > 2. Thus, no 
rigidity result holds in this case. This, however, is a singular phenomenon. 

The first rigidity result was obtained by Calabi and Vesentini[36] who showed 
that compact quotients of any irreducible Hermitian symmetric space of noncom- 
pact type other than Sl( 2 , R)/SO(2) are infinitesimally, hence locally, rigid. They 
showed that the relevant cohomology group rising from the theory of Kodaira and 
Spencer vanishes in all these cases. Their result means that there do not exist non- 
trivial continuous families of uniform lattices in G/K other than Sl(2,R)/SO(2). 

Mostow[190] showed strong rigidity of compact quotients of irreducible symmet- 
ric spaces of noncompact type. This means that any two such lattices F, F' which 
are isomorphic as abstract groups are lattices in the same G and isomorphic as 
subgroups of G. Geometrically this means that the quotients F\G/K and F'\G/K 
are isometric. (Here, as always, they carry the Riemannian metric induced from the 
symmetric metric on G/K.) 

Margulis[173] then showed superrigidity if rank (G/K) > 2. This essentially 
means that any homomorphism p : F — > H (F as above) extends to a homomorphism 
G — > H, if H, like G, is a simple noncompact algebraic group (defined over R) and 
if p(F) is Zariski dense, or that p(F) is contained in a compact subgroup of H, 
if H is an algebraic subgroup of some Sl(n, Q p ). Here, Q p stands for the p-adic 
numbers. More generally and precisely, if G is a semisimple Lie group without 
compact factors with maximal compact subgroup K, rank (G/K) > 2, if F is an 
irreducible lattice in G (irreducibility means that no finite cover of the quotient 
F\G/K is a nontrivial product; this condition is nontrivial only in the case where 
G/K itself is not irreducible, i.e. a nontrivial product), and if H is a reductive 
algebraic group over R,C, or some Q p , then any homomorphism p : F — > H with 
Zariski dense image (this means that p(F) is not contained in a proper algebraic 


r -a GxL 

a 

H 

subgroup of H) factors through a homomorphism where L is a compact group. The 
results of Margulis and their proofs can be found in [266]. Important generalizations 
are given in [174]. Margulis also showed that superrigidity implies arithmeticity of a 
lattice F. This means that r is obtained from the prototype Sl(n, R) by certain finite 
algebraic operations, namely taking the intersection of SI (n,Z) with Lie subgroups 
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of Sl(n, R), applying surjective homomorphisms between Lie groups with compact 
kernels, passing to sublattices of finite index or taking finite extensions of lattices. 

In the Perspectives on 8.10, we shall discuss how harmonic maps can be used 
to prove superrigidity. 


Exercises for Chapter 5 

1) Show that real projective space RP" (cf. exercise 3 of Chapter 1) can 
be obtained as the space of all (real) lines in R" +1 . Show that RP 1 
is diffeomorphic to S 1 . Compute the cohomology of RP". Show that 
RP" carries the structure of a symmetric space. 

2) Similarly, define and discuss quaternionic projective space HP" as 
the space of all quaternionic lines in quaternionic space H” +1 . In 
particular, show that it is a symmetric space. 

3) Determine all Killing fields on S n . 

4) Determine the Killing forms of the groups Sl(n, C) , Sp(n, R) , SU (n) , U (n ) . 

5) Discuss the geometry of S n by viewing it as the symmetric space 
SO(n + l)/SO(n). 

6) Show that CP" = SU(?i+ 1)/S(U(1) x U(n)). Compute the rank of 
CP" as a symmetric space. 

7) Determine the closed geodesics and compute the injectivity radius of 
the symmetric space RP" (cf. 1)). 
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6.1 Preliminaries: Aims of Morse Theory 

Let X be a complete Riemannian manifold, not necessarily of finite dimension 
1 . We shall consider a smooth function / on X, i.e. / £ C°°(X, R) (actually 
f G C 3 (X, R) usually suffices) . The essential feature of the theory of Morse 
and its generalizations is the relationship between the structure of the critical 
set of /, 

C(f) := {xGX: df(x) = 0} 

(and the space of trajectories for the gradient flow of /) and the topology of 
X. 

While some such relations can already be deduced for continuous, not nec- 
essarily smooth functions, certain deeper structures and more complete re- 
sults only emerge if additional conditions are imposed onto f besides smooth- 
ness. Morse theory already yields very interesting results for functions on 
finite dimensional, compact Riemannian manifolds. However, it also applies 
in many infinite dimensional situations. For example, it can be used to show 
the existence of closed geodesics on compact Riemannian manifolds M by 
applying it to the energy functional on the space X of curves of Sobolev class 
H 1 ’ 2 in M, as we shall see in § 6.11 below. 

Let us first informally discuss the main features and concepts of the theory 
at some simple example. We consider a compact Riemannian manifold X 
diffeomorphic to the 2-sphere S 2 , and we study smooth functions on X; 
more specifically let us look at two functions /i , /2 whose level set graphs are 
exhibited in the following figure, 

^ In this textbook, we do not systematically discuss infinite dimensional Riemannian mani- 
folds. The essential point is that they are modeled on Hilbert instead of Euclidean spaces. At 
certain places, the constructions require a little more care than in the finite dimensional case, 
because compactness arguments are no longer available. 
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with the vertical axis describing the value of the functions. The idea of Morse 
theory is to extract information about the global topology of X from the 
critical points of /, i.e. those p £ X with 

df{p) = 0. 

Clearly, their number is not invariant; for fi, we have two critical points, 
for / 2 , four, as indicated in the figure. In order to describe the local ge- 
ometry of the function more closely in the vicinity of a critical point, we 
assign a so-called Morse index p(p) to each critical point p as the number 
of linearly independent directions on which the second derivative d 2 f(p) is 
negative definite (this requires the assumption that that second derivative is 
nondegenerate, i.e. does not have the eigenvalue 0, at all critical points; if 
this assumption is satisfied we speak of a Morse function) . Equivalently, this 
is the dimension of the unstable manifold W u (p). That unstable manifold is 
defined as follows: We look at the negative gradient flow of /, i.e. we consider 
the solutions of 


x : K -> M 

x(t) = — grad f(x(t)) for all t £ R. 

It is at this point that the Riemannian metric of A' enters, namely by defining 
the gradient of / as the vector field dual to the 1-form df. The flow lines 
x(t) are curves of steepest descent for /. For t — > ±oo, each flow line x(t) 
converges to some critical points p = x(— oo), q = x(oo) of /, recalling that in 
our examples we are working on a compact manifold. The unstable manifold 
W u (p) of a critical point p then simply consists of all flow lines x(t) with 
x(— oo) = p, i.e. of those flow lines that emanate from p. 
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In our examples, we have for the Morse indices of the critical points of /i 


and for /2 


Wife) = 2 i Wife) = 0, 


Wafe) = 2 > Wafe) = 2 > Wafe) = 1 i Wafe) = 0 i 

as /i has a maximum point pi and a minimum j>2 as its only critical points 
whereas fi has two local maxima pi,p2, a saddle point P3 , and a minimum p^ . 
As we see from the examples, the unstable manifold W u (p) is topologically a 
cell (i.e. homeomorphic to an open ball) of dimension p{p), and the manifold 
X is the union of the unstable manifolds of the critical points of the function. 
Thus, we get a decomposition of X into cells. In order to see the local effects 
of critical points, we can intersect W u (p) with a small ball around p and 
contract the boundary of that intersection to a point. We then obtain a 




pointed sphere (S^ p \ pt.) of dimension p(p). These local constructions al- 
ready yield an important topological invariant, namely the Euler character- 
istic xfe), as the alternating sum of these dimensions, 
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xw = E (-1 r (p W)- 

p crit. pt. of / 

We are introducing the signs (— 1) M ^ here in order to get some cancellations 
between the contributions from the individual critical points. This issue is 
handled in more generality by the introduction of the boundary operator d. 
From the point of view explored by Floer, we consider pairs ( p , q) of critical 
points with p(q) = p(p) — 1, i.e. of index difference 1. We then count the num- 
ber of trajectories from p to q modulo 2 (or, more generally, with associated 
signs as will be discussed later in this chapter): 

dp = E (#{fl° w lines from p to q} mod 2 )q. 

q crit. pt of / 

M(<3) = m(p) — 1 

In this way, we get an operator from C*(/, Z 2 ), the vector space over Z 2 
generated by the critical points of /, to itself. The important point then is 
to show that 

d o d = 0. 

On this basis, one can define the homology groups 

Hk(X,f, Z 2 ) := kernel of d on Cfc(/,Z 2 ) /image of d from 

C k+ i(f, Z 2 ), 

where Ck{f , Z 2 ) is generated by the critical points of Morse index k. (Because 
of the relation dod = 0, the image of d from Cfc+i(/,Z 2 ) is always contained 
in the kernel of d on Ck{f, Z 2 ).) We return to our examples: In the figure, we 
now only indicate flow lines between critical points of index difference 1. 




For fi , there are no pairs of critical points of index difference 1 at all. Denoting 
the restriction of d to Ck(f, Z 2 ) by dk, we then have 
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ker d 2 = {pi} 
ker<9 0 = {p 0 }, 

while d\ is the trivial operator as Ci(fi,Z 2 ) is 0. All images are likewise 
trivial, and so 

H 2 (X, fi, Z 2 ) = Z 2 
H 1 (X,f 1 ,Z 2 )=0 
Ho(X, fi, Z 2 ) = Z 2 

Putting 

bk := dimz 2 Hk(X, /, Z 2 ) (Betti numbers), 
in particular we recover the Euler characteristic as 

x(x) = £(- 1 )% 

3 

Let us now look at / 2 . Here we have 

d 2 Pi = 9 2 p2 = P 3 , hence d 2 (pi + P 2 ) = 2p 3 = 0 
diP 3 = 2p 4 = 0 (since we are computing mod 2) 
d 0 p 4 = 0 


Thus 

H 2 (X, / 2 , Z 2 ) = ker d 2 = Z 2 
Hi(X, / 2 ,Z 2 ) = ker<9i/image<9 2 = 0 
H 0 (XJ 2 ,Z 2 ) = ker a 0 /image5i = Z 2 . 

Thus, the homology groups, and therefore also the Betti numbers are the 
same for either function. This is the basic fact of Morse theory, and we also see 
that this equality arises from cancellations between critical points achieved 
by the boundary operator. 

This will be made more rigorous in §§ 6.3 - 6.10. 

As already mentioned, there is one other aspect to Morse theory, namely 
that it is not restricted to finite dimensional manifolds. While some of the 
considerations in this Chapter will apply in a general setting, here we can 
only present an application that does not need elaborate features of Morse 
theory but only an existence result for unstable critical points in an infinite 
dimensional setting. This will be prepared in § 6.2 and carried out in § 6.11. 
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6.2 Compactness: The Palais-Smale Condition and the 
Existence of Saddle Points 


On a compact manifold, any continuous function assumes its minimum. It 
may have more than one local minimum, however. If a differentiable func- 
tion on a compact manifold has two local minima, then it also has another 
critical point which is not a strict local minimum. These rather elementary 
results, however, in general cease to hold on noncompact spaces, for example 
infinite dimensional ones. The attempt to isolate conditions that permit an 
extension of these results to general, not necessarily compact situations is 
the starting point of the modern calculus of variations. For the existence of a 
minimum, one usually imposes certain generalized convexity conditions while 
for the existence of other critical points, one needs the so-called Palais-Smale 
condition. 

Definition 6.2.1 f £ C 1 (X, K) satisfies condition (PS) if every sequence 
(z n )neN with 

(i) \f(x n )\ bounded 

(ii) \\df(x n )\\ -> 0 for n — *■ oo 
contains a convergent subsequence. 

Obviously, (PS) is automatically satisfied if X is compact. It is also sat- 
isfied if / is proper, i.e. if for every c£t 

{x£X : \f(x)\ < c} 

is compact. However, (PS) is more general than that and we shall see in the 
sequel (see § 6.11 below) that it holds for example for the energy functional 
on the space of closed curves of Sobolev class H 1,2 on a compact Riemannian 
manifold M. 

For the sake of illustration, we shall now demonstrate the following result: 

Proposition 6.2.1 Suppose f € C 1 (X,R) satisfies (PS) and has two strict 
relative minima X\,x 2 G X. Then there exists another critical point £3 of f 
(i.e. df(x 3) = 0) with 

f(x 3) = k:= inf max/(cc) > max{/(£i), f(x 2 )} (6.2.1) 

7 gr 2:67 

with r := {7 € C°([0,1],X) : 7(0) = £1,7(1) = x 2 }, the set of all paths 
connecting X\ and x 2 . (£3 is called a saddle point for /.) 
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We assume also that solutions of the negative gradient flow of /, 

J^(z> t) = -grad r, f)) (6.2.2) 

< 7 ?(x, 0) = a; 

exist for all x G X and 0 < t < s, for some £ > 0. (grad / is the gradient 
of /, see (2.1.13); it is the vector field dual to the 1-form df.) 

Proof. Since x\ and X 2 are strict relative minima of /, 

3<5o > 0V<5 with 0<(5<(5o3£>OVa; with ||ar — Xi\\ = S : 

f(x) > f(xi) + e for i = 1, 2. 


Consequently 

3e 0 > OVy G T3r G (0,1) : /(q(r)) > max(/(si), f(x 2 )) +e 0 . 

This implies 

k > max(f(xi),f(x2))- (6.2.3) 

We want to show that 


r := {XGR": /(*) = k} 


contains a point X3 with 


rf/fe) = 0. 

If this is not the case, by (PS) there exist 77 > 0 and a > 0 with 

\\df(x)\\ > a 


(6.2.4) 

(6.2.5) 


whenever k — rj < /(x) < k + r]. 

Namely, otherwise, we find a sequence (x n )„gN C X with f(x n ) — » k and 
df(x n ) — > 0 as n — > 00, hence by (PS) a limit point X3 that satisfies /(x’3) = 
k, df(x 3) = 0 as / is of class C 1 . 

In particular, 

f{x 1 ),f(x 2 )<K-ri, (6.2.6) 

since df(x\) = 0 = df(x 2). Consequently we may find arbitrarily small rj > 0 
such that for all 7 G T with max/(7(r)) < k + 77 : 


Vr G [0, 1] : either /(7(r)) < k — 77 

or ||d/(7(r))|| > a. (6.2.7) 

We let </?(x,t) be the solution of (6.2.2) for 0 < t < e. 

We select ij > 0 satisfying (6.2.7) and 7 G T with 
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Then 


max f('y(r)) < k + rj. (6.2.8) 

tG[0,1] 


= -\\dfM 1 (r),t))\\ 2 <0. (6.2.9) 

Therefore 

max t)) < max < k + tj. (6.2.10) 

Since grad/(a’i) = 0,i = 1,2, because x±,X 2 are critical points of /, also 
ip{xi, t ) = Xi for i = 1,2 and all t G K, hence 

¥>('1 (').*) e r - 

(6.2.9), (6.2.6), (6.2.7), and (6.2.2) then imply 
d a 2 

"77 /(^(70)^)) < whenever /(<p(7(r),t) > k - ry. (6.2.11) 

dt 4 

We may assume that the above 77 > 0 satisfies 


Then the negative gradient flow exists at least up to t = -j*. (6.2.10) and 
(6.2.11), however, imply that for to = we have 

/(V j (7(r),i 0 )) < « - V for all r G [0, 1]. 

Since <p(7(-),^o) G P, this contradicts the definition of k. We conclude that 
there has to exist some X3 with f(xz) = k and df(x 3) = 0. □ 

The issue of the existence of the negative gradient flow for / will be 
discussed in the next §. Essentially the same argument as in the proof of 
Prop. 6.2.1 will be presented once more in Thm. 6.11.3 below. 


Perspectives. The role of the Palais-Smale condition in the calculus of variations 
is treated in [142]. A thorough treatment of many further examples can be found in 
[234] and [39] . A recent work on Morse homology in an infinite dimensional context 
is Abbondandolo, Majer[l[. 
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6.3 Local Analysis: Nondegeneracy of Critical Points, 
Morse Lemma, Stable and Unstable Manifolds 


The next condition provides a nontrivial restriction already on compact man- 
ifolds. 


Definition 6.3.1 / G C 2 (X, R) is called a Morse function if for every xq € 
C(f), the Hessian d? f{x o) is nondegenerate. (This means that the continuous 
linear operator 

A : T X0 X -> T* q X 


defined by 


(H„)(r>) = d 2 f(x 0 )(u, v) for u , v G T Xo X 


is bijective.) Moreover, we let 


V~ C T X0 X 

be the subspace spanned by eigenvectors of (the bounded, symmetric, bilinear 
form) d 2 f(x o) with negative eigenvalues and call 

/i(®o) := dim V~ 

the Morse index of x 0 G C(f). For k G N, we let 

C k (f) := {z G C(f) : fi(x) = k} 
be the set of critical points of / of Morse index k. 

The Morse index n{x$) may be infinite. In fact, however, for Morse theory 
in the sense of Floer one only needs finite relative Morse indices. Before we can 
explain what this means we need to define the stable and unstable manifolds 
of the negative gradient flow of / at xq- 

The first point to observe here is that the preceding notion of nondegen- 
eracy of a critical point does not depend on the choice of coordinates. Indeed, 
if we change coordinates via 

x = £(y), for some local diffeomorphism £, 

then, computing derivatives now w.r.t. y, and putting yo = 

d 2 (fo Q(yo){u,v) = (d 2 f)(t(y 0 ))(d(,(y 0 )u,d£(yo)v) for any u,v, 


if 


df{x 0 ) = 0. 


Since d£(yo) is an isomorphism by assumption, we see that 


d 2 (f°0(yo) 
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has the same index as 

d 2 f(x o). 

The negative gradient flow for / is defined as the solution of 


<j>:XxR^X 


= - grad f(<f>(x,t)) (6.3.1) 

4>(x, 0) = x. 

Here, grad / of course is the gradient of / for all x £ X, defined with the 
help of some Riemannian metric on X, see (2.1.13). 

The theorem of Picard-Lindelof yields the local existence of this flow (see 
Lemma 1.6.1), i.e. for every x £ X, there exists some e > 0 such that 4>(x,t) 
exists for —e<t<e. This holds because we assume f £ C 2 (X, R) so that 
grad / satisfies a local Lipschitz condition as required for the Picard-Lindelof 
theorem. We shall assume in the sequel that this flow exists globally, i.e. that 
4> is defined on all of X xt. In order to assure this, we might for example 
assume that d 2 f(x) has uniformly bounded norm on X. 

(6.3.1) is an example of a flow of the type 

cj>-.Xx R-+X 

^4> = V(<j>(x,t)), </>(x,0)=x 

for some vector field V on X which we assume bounded for the present 
exposition as discussed in 1.6. The preceding system is autonomous in the 
sense that V does not depend explicitly on the “time” parameter t (only 
implicitly through its dependence on 0). Therefore, the flow satisfies the 
group property 

(j)(x,ti +< 2 ) = </>(</>(£, H), £ 2 ) for all ti,t 2 £ R (see Thm. 1.6.1). 

In particular, for every x £ X, the flow line or orbit -y x := {(f>{x ,t) : t £ R} 
through x is flow invariant in the sense that for y £ j x , t £ R 

£ lx- 

Also, for every t £ R, t) : X — > X is a diffeomorphism of X onto its image 
(see Theorem 1.6.1). 

As a preparation for our treatment of Morse theory, in the present section 
we shall perform a local analysis of the flow (6.3.1) near a critical point Xq 
of /, i.e. grad f(x 0 ) = 0. 

Definition 6.3.2 The stable and unstable manifolds at Xq of the flow cj) are 
defined as 
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W s (x 0 ) 


£ X : lim (f>(y, t) = Xq 

t — >-+oo 


W u (x o) 


\yeX: lim <j)(y, t) 

I t — > — OO 



Of course, the question arises whether W s (x 0 ) and W u (x 0 ) are indeed 
manifolds. 

In order to understand the stable and unstable manifolds of a critical 
point, it is useful to transform / locally near a critical point Xo into some 
simpler, so-called “normal” form, by comparing / with a local diffeomor- 
phism. Namely, we want to find a local diffeomorphism 

x = €(y), 


with 


xq = £(0) for simplicity 


such that 

/(?(j/)) = fix o) + ^ d 2 f(x 0 )(y , y). (6.3.2) 

In other words, we want to transform / into a quadratic polynomial. Having 
achieved this, we may then study the negative gradient flow in those coordi- 
nates w.r.t. the Euclidean metric. It turns out that the qualitative behaviour 
of this flow in the vicinity of 0 is the same as the one of the original flow in 
the vicinity of xo = £(0) . 

That such a local transformation is possible is the content of the Morse- 
Palais-Lemma: 


Lemma 6.3.1 Let B be a Banach space, U an open neighborhood of Xq £ B, 
f £ C k+2 (U,M.) for some k > 1, with a nondegenerate critical point at xq. 
Then there exist a neighborhood V of 0 £ B and a diffeomorphism 

f(V) C U 

of class C k with £(0) = Xq satisfying (6.3.2) in V . In particular, nondegen- 
erate critical points of a function f of class C 3 are isolated. 

Proof. We may assume Xo = 0, /( 0) = 0 for simplicity of notation. 

We want to find a flow 


ip : V x [0, 1] -► B 


<p(y, 0 ) = y 

1)) = ^d 2 f(0)(y, y) for all y £ V. 


with 


(6.3.3) 

(6.3.4) 



304 6. Morse Theory and Floer Homology 


£(y) := ip(y, 1) then has the required property. We shall construct <p(y,t) so 
that with 

V(y,i) ■= tf(y ) + ^(i - t)d 2 f(0)(y,y), 

we have 

T£l{v{y,t),t) = 0, (6.3.5) 

implying 

/(^(y. !)) = 1), 1) = r){<p(y, 0), 0) = ^d 2 /(0)(y, y) 

as required. (6.3.5) means 

0 = f{<p{y, t)) + t df(<p(y, t))^ip(y, t) (6.3.6) 

-^d 2 f(0)(<p(y,t),<p(y,t)) + (1 - t)d 2 f(0)(<p(y,t), ^(f{ y,t )). 
Now by Taylor expansion, using df{ 0) = 0, 

f(x) = [ (1 - r)d 2 f(rx)(x , x) dr 

Jo 

df(x) = f d 2 f(rx)x dr. 

Jo 

Inserting this into (6.3.6), with x = ip(y,t), we observe that we have a com- 
mon factor i p(y,t) in all terms. Thus, abbreviating 

T 0 (x) := -^d 2 /(0) + J (1 - r)d 2 f(rx) dr 

Ti(x, t) := d 2 f( 0) + t [ (d 2 f(rx) - d 2 f( 0)) dr, 

Jo 

(6.3.6) would follow from 

0 = T 0 (<p(y,t))ip(y,t) +Ti(<p(y,t),t)^ip(y,t). (6.3.7) 

Here, we have deleted the common factor <p(y,t), meaning that we now con- 
sider e.g. d 2 /( 0) as a linear operator on B. 

Since we assume that d 2 /( 0) is nondegenerate, d 2 f( 0) is invertible as a 
linear operator, and so then is T\(x,t) for x in some neighborhood IT of 0 
and all t € [0, 1]. 

Therefore, 

-Ti(v>(y, ° T o (<p(y, t))<p(y, t ) 

exists and is bounded if <p(y,t) stays in IT. Therefore, a solution of (6.3.7), 
i.e. of 
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^P{y,t) = -T^wiVit) x o T 0 (<p(y,t))ip(y,t), (6.3.8) 

stays in W for all t £ [0,1] if <p(y, 0) is contained in some possibly smaller 
neighborhood V of 0. The existence of such a solution then is a consequence 
of the theorem of Picard-Lindelof for ODEs in Banach spaces. This completes 
the proof. □ 

Remark. The preceding lemma plays a fundamental role in the classical ex- 
positions of Morse theory. The reason is that it allows to describe the change 
of topology in the vicinity of a critical point xq of / of the sublevel sets 

f x := {yeX: f{y) < A} 

as A decreases from f(x o) + e to f(x o) — £ , for £ > 0. 

The gradient flow w.r.t. the Euclidean metric for / of the form (6.3.2) 
now is very easy to describe. Assuming w.l.o.g. f(x o) = 0, we are thus in the 
situation of 

9{v) = 2 B (y>y)’ 

where £?(•,•) is a bounded symmetric quadratic form on a Hilbert space H. 
Denoting the scalar product on H by (•,•), B corresponds to a selfadjoint 
bounded linear operator 

L: H -> H 


via 

( L(u),v ) = B(u,v) 

by the Riesz representation theorem, and the negative gradient flow for g 
then is the solution of 

-Q t = - L(j>{y,t ) 

o) = y- 

If v is an eigenvector of L with eigenvalue A, then 

(j)(v, t) = e~ xt v. 

Thus, the flow exponentially contracts the directions corresponding to pos- 
itive eigenvalues, and these are thus stable directions, while the ones corre- 
sponding to negative eigenvalues are expanded, hence unstable. 

Let us describe the possible geometric pictures in two dimensions. If we 
have one positive and one negative eigenvalue, we have a so-called saddle, 
and the flow lines in the vicinity of our critical point look like: 
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Fig. 6.3.1. The horizontal axis is the unstable, the vertical one the stable 
manifold. 

If we have two negative eigenvalues, hence two unstable directions, we have 
a node. If the two eigenvalues are equal, all directions are expanded at the 
same speed, and the local picture is 



Fig. 6.3.2. 

If they are different, we may get the following picture, if the one of largest 
absolute value corresponds to the horizontal direction 



Fig. 6.3.3. 
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The situations of Figures 6.3.2 and 6.3.3 are topologically conjugate, but 
not differentiably. However, if we want to preserve conditions involving deriva- 
tives like the transversality condition imposed in the next section, we may 
only perform differentiable transformations of the local picture. It turns out 
that the situation of Figure 6.3.1 is better behaved in that sense. 

Namely, the main point of the remainder of this section is to show that the 
decomposition into stable and unstable manifolds always has the same quali- 
tative features in the differentiable sense as in our model situation of a linear 
system of ODEs (although the situation for a general system is conjugate to 
the one for the linearized one only in the topological sense, as stated by the 
Hartmann-Grobman-Theorem) . All these results will depend crucially on the 
nondegeneracy condition near a critical point, and the analysis definitely be- 
comes much more complicated without such a condition. In particular, even 
the qualitative topological features may then cease to be stable against small 
perturbations. While many aspects can still be successfully addressed in the 
context of the theory of Conley, we shall confine ourselves to the nondegen- 
erate case. 

By Taylor expansion, the general case may locally be considered as a 
perturbation of the linear equation just considered. Namely, we study 

t) = —L<j)(y, t) + rj((j>(y, t)) (6.3.10) 

4>{y,o) = V > 

in some neighborhood U of 0, where 77 : H — > H satisfies 

77 ( 0 ) = 0 

h(x) - v(y ) II < ^(e)lk - y\\ (6.3.11) 

for ||x||, \\y\\ < e, with S(e) a continuous monotonically increasing function of 
£ € [0, 00 ) with 5(0) = 0. The local unstable and stable manifolds of 0 then 
are defined as 

W u ( 0, U) = |x € U : 4>(x, t) exists and is contained in U for all t < 0, 
lim (/>(;r, f) = 0 } 

1T S (0, U) = ja: € U : (j>(x, t ) exists and is contained in U for all t > 0, 

lim <fi(x,t)=o\. 
t — >-+00 J 

We assume that the bounded linear selfadjoint operator L is nondegener- 
ate, i.e. that 0 is not contained in the spectrum of L. As L is selfadjoint, the 
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spectrum is real. H then is the orthogonal sum of subspaces H +1 H_ in- 
variant under L for which L\ H+ has positive, L\ H _ negative spectrum, and 
corresponding projections 

P±:H->H±, P+ + P_ = Id. 


Since L is bounded, we may find constants co ,7 > 0 such that 
||e _it P+|| < Coe -7 * for t > 0 


\e~ Lt P-\\ < c 0 e 7t for t <0. 


(6.3.12) 


Let now y(t) = <j>(x,t) be a solution of (6.3.10) for t > 0. We have for any 
t e [o, oo ) 

t 

y(t) = e~ L{t ^ T) y(T) + J e~ L{t ~ s '>y(y(s))ds, (6.3.13) 


hence also 


l 

P±y(t ) = e~ L(t ~ r) P ± y(r) + j e~ L(t ~ s) P±y{y{s)) ds. (6.3.14)± 

r 

If we assume that y(t) is bounded for t > 0, then by (6.3.12) 

lim e“ i(t " T) P_ 2 /(r) = 0, (6.3.15) 

T— >■ OO 

and hence such a solution y(t) that is bounded for t > 0 can be represented 


as 


y{t) = P+y(t) + P-y(t) 


z 

= e -"r +I + / e -«-)p + ,«,))< i. 


0 

oo 


(6.3.16) 


1 6 L< ' t ^ with x = y(0) 


(putting t = 0 in (6.3.14) + , r = oo in (6.3.14)_). Conversely, any solution of 
(6.3.16), bounded for t > 0, satisfies (6.3.13), hence (6.3.10). For a solution 
that is bounded for t < 0, we analogously get the representation 


u t 

y(t) = e~ Lt P_x — J e~ L( - t ~ s ^P-r](y(s)) ds + J e ~ L ^ t ~ s ' > P + rj{y{s)) ds. 


Theorem 6.3.1 Let <j>(y,t) satisfy (6.3.10), with a bounded linear nondegen- 
erate selfadjoint operator L and rj satisfying (6.3.11). Then we may find a 
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neighborhood U of 0 such that W s (0, U) (W u { 0, U)) is a Lipschitz graph over 
P+H D U (P-H nu), tangent to P+H (P_H ) at 0. If y is of class C k in U, 
so are W s (0,U) and W u (0,U). 


Proof. We consider, for x € P+H, 

t OO 

T(y,x)(t) :=e~ Lt x + J e~ L ^~ s ^ P+y{y{s)) ds — J e~ L ^~ s ^ P-T](y(s)) ds. 

o t 

(6.3.17) 

From (6.3.16) we see that we need to find fixed points of T, i.e. 

y(t) = T (y,x)(t). (6.3.18) 

In order to apply the Banach fixed point theorem, we first need to identify 
an appropriate space on which T(-,x) operates as a contraction. For that 
purpose, we consider, for 0 < A < 7 , e > 0, the space 


M x (e) 


y(t) : ||y||exp,A := sup e A 4 ||j/(f)|| < 
t> 0 


(6.3.19) 


M\(e) is a complete normed space. We fix A, e.g. A = in the sequel. 
Because of (6.3.11), (6.3.12), we have for y G M\(e) 


\\T(y,x)(t)\\ < c 0 e 7 *||a;| 


< c 0 e 7 *||a;| 


+ c 0 S(s)^J e 7(4 s) ||y(s)|| ds 

OO \ 

+ J e^-')||y( S )||d a J 

+ co<5(e) ( sup e As ||j/(s)|| f e _7 ^ _s - ) e _As ds 
\0<s<t J 

v 0 

OO \ 

+ sup e As || 2 /(s)|| /e 7(t - s) e- As ds J. 
t<S< OO J J 

t 7 

(6.3.20) 


Now since 

t 


J e- 7 (t " s) e- As ds = e" 74 -!-^ (e (7 " A)4 - l) < 


— A t 


7 — A 


OO 

[ e llt-s) e -\s ds = e 7 *_L^ e -( 7 +A)t = -At 

J 7 + A 7 +A 
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(6.3.20) implies 

\\T(y,x)(t)\\ < coe-^WxW + 2C ° d( fe- At || 2/ || exp , A . (6.3.21) 

Similarly, for yi,y 2 € M\{e) 

|| T(yi,x)(t)-T(y 2 ,x)(t)\\ < ^M e - Xt \\ yi ~ y 2 || exp , A . (6.3.22) 


Because of our assumptions on (5(e) (see (6.3.11), we may choose e so small 
that 


4co xf , , 1 

7 ^A S(£)S 2' 

(6.3.23) 

Then from (6.3.22), for yi,y 2 G M\(a :) 


\\T(V 1 ,X) - T(y 2 ,x) ||exp,A < ^\\Vl - Z/2 1| exp, A • 

(6.3.24) 

If we assume in addition that 


INI < P, 

2 c 0 

(6.3.25) 

then for y G by (6.3.21) 


||r(2/,x)||exp,A < £• 

(6.3.26) 


Thus, if e satisfies (6.3.23), and ||x|| < then T(-,x) maps M\(e) into 
itself, with a contraction constant Therefore applying the Banach fixed 
point theorem, we get a unique solution y x G M\{e) of (6.3.18), for any 
x € P+H with ||x|| < 

Obviously, T(0, 0) = 0, and thus y 0 = 0. Also, since y x G M A (e) is decay- 
ing exponentially, we have for any x (with ||x|| < ^) 

lim y x (t) = 0, 

t—*oo 

i.e. 

yM G IT s (0). 

From (6.3.17), we have 

t OO 

y x (t) = e~ Lt x- J e~ L{ - t ~ s) P + r](y x {s))ds- j e~ L ^~ s) P-rj(y x (s)) ds. 
o t 

y x lies in M(e) and so in particular is bounded for t > 0. Thus, it also satisfies 
(6.3.16), i.e. 
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t OO 

Ux(t) = e~ Lt P+y x (0) + J e~ L{ - t ~ s) P + y{y x {s))ds- j e~ L ^~ s) P-y{y x {s)) ds, 

o t 

and comparing these two representations, we see that 

x = P+y x ( 0). (6.3.27) 

Thus, for any U C {||x|| < 9 §^}, we have a map 

H+ n U -* W s (0) 

X J/a; (0) , 

with inverse given by P+, according to (6.3.27). We claim that this map is 
a bijection between H + fl U and its image in IT s (0). For that purpose, we 
observe that as in (6.3.21), we get assuming (6.3.25), 

WVxi {t) Vx 2 W II — c 0® ^ \\Xl X '2 1| + — 112/cci Ux 2 1 1 exp, A) 

hence 


WVxi (0) - iU0)|| < I \y Xl - y X 2 llexp.A < 2c 0 ||xi - X 2 ||. (6.3.28) 

We insert the second inequality in (6.328) into the integrals in (6.3.17) and 
use (6.3.12) as before to get from (6.3.17) 

lllUO) ^ l/x a (0)|| > Ikr - X 2 1 - - z 2 ||. 

7 — A 

If in addition to the above requirement ^Co5(e) < j we also impose the 
condition upon e that 

4c^(g) 1 

7- A “ 2’ 

the above inequality yields 

llfe(0) - y X 2 (0)11 > ^11*1 - *2 II- (6.3.29) 

Thus, the above map indeed is a bijection between {x € P+H, ||x|| < 
and its image W in IT s (0). (6.3.28) also shows that our map x 1 — > y x (0) is 
Lipschitz, whereas its inverse is Lipschitz by (6.3.29). 

In particular, since yo = 0 as used above, W contains an open neighbor- 
hood of 0 in ir s (0), hence is of the form IT s (0, U) for some open U. 

We now verify that IT s (0, U) is tangent to P+H at 0. (6.3.11), (6.3.17) 
and (6.3.28) (for X\ = x,X2 = 0, recalling y 0 = 0) 



312 6. Morse Theory and Floer Homology 


||P_y x ( 0 )|| = || J e Ls P-ii(y x (s)) ds\\ 

0 

oo 

< c 0 J e _7S <KIMs)l|)fe(s)|| ds 

0 


oo 

< c 0 Je^S (2c 0 e _As ||ar||) 2c 0 e _As ||a;|| 


< 


o 

2 c? 


o 


7 - A 


5 ( 20011 * 11 ) 11 * 11 . 


This implies 

11^(0)11 

11^(0)11 


11 ^( 0)11 

11*11 


0 as y x ( 0) -> 0 in W s (0,U), 
or equivalently x — > 0 in P+H. 


This shows that W s ( 0, U) indeed is tangent to P+H at 0. 

The regularity of W s (0,U) follows since T(y,x ) in (6.3.17) depends 
smoothly on 77 . (It is easily seen from the proof of the Banach fixed point 
theorem that the fact that the contraction factor is < 1 translates smooth- 
ness of T as a function of a parameter into the same type of smoothness of 
the fixed point as a function of that parameter.) 

Obviously, the situation for W u ( 0, U) is symmetric to the one for IT s (0, U). 

□ 


The preceding theorem provides the first step in the local analysis for the 
gradient flow in the vicinity of a critical point of the function /. It directly 
implies a global result. 


Corollary 6.3.1 The stable and unstable manifolds W s (x),W u (x) of the 
negative gradient flow </> for a smooth function f are injectively immersed 
smooth manifolds. (If f is of class C k+2 , then W s (x) and W u (x) are of class 
C k .) 


Proof. We have 

W S (X)= \Jf(;t)(W S (x,U)) 

t< 0 

W*(x) = (J </>(', t)(W u (x,U)) 

t> 0 

for any neighborhood U of x. □ 


Of course, the corollary holds more generally for the flows of the type 
(6.3.10) (if we consider only those flow lines (j>(-,t) that exist for all t < 0 
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resp. t > 0). (The stable and unstable sets then are as smooth as rj is.) 
The point is that the flow (/>(•, t), for any t and any open set U, provides a 
diffeomorphism between U and 4>{U,t ), and the sets 4>(U,t) cover the image 
of </)(-,■). 

The stable and unstable manifolds IT s (0), W“(0) for the flow (6.3.10) are 
invariant under the flow, i.e. if e.g. 

x = <f>(x,0) G W u { 0), 

then also 


x(t) = (f>{x, t) G W w ( 0) for allt G R. for which it exists. 

In 6.4, we shall easily see that because / is decreasing along flow lines, the 
stable and unstable manifolds are in fact embedded, see Corollary 6.4.1. 

We return to the local situation. The next result says that more generally, 
in some neighborhood of our nondegenerate critical point 0, we may find a so- 
called stable foliation with leaves A s (z u ) parametrized by z u G W u ( 0), such 
that where defined, A s (0) coincides with W s (0) while all leaves are graphs 
over W s (0), and if a flow line starts on the leaf A s (z u ) at t = 0, then at 
other times t, we find it on A s ((f>(z u , t)), the leaf over the flow line on W u ( 0) 
starting at z u at t, = 0. Also, as t increases, different flow lines starting on 
the same leaf approach each other at exponential speed. 

The precise result is 

Theorem 6.3.2 Suppose that the assumptions of Theorem 6.3.1 hold. There 
exist constants ci,A > 0, and neighborhoods U of0inH,V of 0 in P+H 
with the following properties: 

For each z u G W u (f) 1 U), there is a function 

<p Zu :V^ H. 

c p Zu (z + ) is as smooth in z u ,z+ as p is, for example of class C k if rj belongs 
to that class. If 

z G A s (z u ) = <p Zu (V), 

then 

<t>(z,t) = ip^ Zu ^(P + (/)(z,t)) (6.3.30) 

and 

\\4>{z, t) - 4>{z u ,t) || < Cie _A * (6.3.31) 

as long as <j)(z,t), <j>(z u ,t) remain in U. 

We thus have a smooth (of class C k , if rj G C k ), so-called stable foliation 
which is flow invariant in the sense that the flow maps leaves to leaves. In 
particular, yl s (0) is the stable manifold W s (0)<lV , <f>(z,t) approaches W /s (0)fl 
V exponentially for negative t, as long as it stays in U . 

Of course, there also exists an unstable foliation with analogous properties. 
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Fig. 6.3.4. 

Corollary 6.3.2 Let f : X — > 1 be of class C k+2 , k > 1, x a nondegenerate 
critical point of f. Then in some neighborhood U of x, there exist two flow- 
invariant foliations of class C k , the stable and the unstable one. The leaves 
of these two foliations intersect transversally in single points , and conversely 
each point of U is the intersection of precisely one stable and one unstable 
leaf. 

The Corollary is a direct consequence of the Theorem, and we thus turn 
to the proof of Thm. 6.3.2: 

Changing rj outside a neighborhood U of 0 will not affect the local struc- 
ture of the flow lines in that neighborhood. By choosing U sufficiently small 
and recalling (6.3.11), we may thus assume that the Lipschitz constant of 77 
is as small as we like. We apply (6.3.13) to ) and <j)(z u ,t ) and get for 
r > 0 , putting y(t\ z, z u ) := </>(z, t) - (j>(z u , t), 

y(t ; z u ) = e _i(t_r) 2 /(r; z, z u ) 

t 

+ j e~ L(t ~ s \ifl(l)(z, s)) -r]((j)(z u ,s)))ds. (6.3.32) 

r 

If this is bounded for t — > 00 , then (6.3.12) implies, as in (6.3.15), 

lim e~ L ^^ T ^P-y{t\z,z u ) = 0. (6.3.33) 

T — >00 

Consequently, as in (6.3.16) we get, 

y(t ; 2 , z u ) = e~ Lt P + y(0-, z, z u ) (6.3.34) 

+ / e~ L{t ~ s) P+(r]((j)(z u ,s) + y(s-,z, z u )) - r)(<j)(z u , s))) ds 

J 0 
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— J e L(t ^P-(r]((j>(z u ,s)+y(s;z,z u ))-ri(<j>(z u ,s)))ds. 

t 

As in the proof of Thm. 6.3.1, we want to solve this equation by an application 
of the Banach fixed point theorem, i.e. by finding a fixed point of the iteration 
of 


T(y,z u ,z+) := e Lt z + (6.3.35) 

t 

+ J e.~ L{t ~ s) P+(ji((t>(z u ,s) + y(s)) -r)((j>(z u ,s)))ds 

0 

oo 

- J e~ L{t ~ s) P-{y{(j)(z u , s) + y(s)) - r}{<f>{z u , s))) ds, 

t 

for 2 + £ P + H. As in the proof of Thm. 6.3.1, we shall use a space M\(e 0 ) 
for some fixed 0 < A < 7 . 

Before we proceed to verify the assumptions required for the application 
of the fixed point theorem, we wish to describe the meaning of the construc- 
tion. Namely, given z u £ W u ( 0), and the orbit <j>(z u ,t) starting at z u and 
contained in W u ( 0), and given z + £ P+H, we wish to find an orbit <j>(z,t) 
with P + (f>(z,0) = P+z = z+ that exponentially approaches the orbit <f>(z u ,t) 
for t > 0. The fixed point argument will then show that in the vicinity of 0, 
we may find a unique such orbit. If we keep z u fixed and let z+ vary in some 
neighborhood of 0 in P+H , we get a corresponding family of orbits <j)(z,t), 
and the points z = <j>{z, 0 ) then constitute the leaf through z u of our folia- 
tion. The leaves are disjoint because orbits on the unstable manifold W u ( 0) 
with different starting points for t = 0 diverge exponentially for positive t. 
Thus, any orbit <t>{z,t) can approach at most one orbit <j>(z u ,t) on W" (0) 
exponentially. In order to verify the foliation property, however, we also will 
have to show that the leaves cover some neighborhood of 0 , i.e. that any flow 
line <f>(z, t) starting in that neighborhood for t, = 0 approaches some flow line 
< j>(z u ,t ) in W“(0) exponentially. This is equivalent to showing that the leaf 
through z u depends continuously on z u , and this in turn follows from the 
continuous dependence of the fixed point of T(-, z u , z+) on z u . 

Precisely as in the proof of Thm. 6.3.1, we get for 0 < A < 7 (say A = ^), 
with c 0 , 7 as in (6.3.12), ||2+|| < £ 1 , y £ M x (s 0 ), i.e. \\y(t)\\ < e~ xt £ 0 , and 
with [y]u.p being the Lipschitz constant of 7 

\\T(y, z u , z+)(t)\\ < c 0 £ie -7t + 2c ^-['n\Li P e~ xt (6.3.36) 

7 - A 

and 

\\T{yi,z u ,z+)(t) - T(y 2 , z u , z+)(t)\\ < 4 c ° ^ Lip e~ xt \\y! -y 2 ||ex P ,A- (6.3.37) 

7 — A 
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As remarked at the beginning of this proof, we may assume that [t]\ j jlp is 
as small as we like. Therefore, by choosing eq > 0 sufficiently small, we 
may assume from (6.3.36) that T(-, z u , z+) maps M\[e o) into itself, and from 

(6.3.37) that it satisfies 

\\T(yi,z u ,z+) - T(y 2 , z u , z+)\\ exp, A 5: 2 H ^ 1 2/2 1 1 exp, A* 

Thus, the Banach fixed point theorem, applied to T(-, z u , z+) on the space 
M x (e 0 ), yields a unique fixed point y Zu , z+ on this space. We now put 

‘Pzu(zi) == Uz u ,z + 

z=y Zu , z+ (0). (6.3.38) 

We then have all the required relations: 

P+z = P+y ZutZl (0) = z\ from (6.3.35), 

and hence y Zu , z+ solves (6.3.34), i.e. is of the form y(t;z,z u ) with z from 

(6.3.38) , and <j>{z,t) = y(t;z,z u ) + <j>(z u ,t) is a flow line. Condition (6.3.30) 
thus holds at t = 0. Since the construction is equivariant w.r.t. time shifts, 
because of the group property 

<j>(z,t + t) = <f>((f>(z , t) , t) for all i, r, 

(6.3.30) holds for any t, as long as <j>{z,t) stays in our neighborhood U of 0. 
The exponential decay of <j>(z, t) — (f>(z u , t) = y(t\ z, z u ) follows since we have 
constructed our fixed point of T in the space of mappings with precisely that 
decay. 

Since T is linear in z+, we see as before in the proof of Thm. 6.3.1 that 
a smoothness property of ry translates into a smoothness property of y Zu as 
a function of z+. It remains to show the smoothness of y ZutZ+ as a function 
of z u . This, however is a direct consequence of the fact that y Zu , z+ is a fixed 
point of T(-, z u , z+), an operator with a contraction constant < 1 on the 
space under consideration (M\(e o)), and so the smooth dependence of T (see 
(6.3.35)) on the parameters z u and z + (which easily follows from estimates 
of the type used above) translates into the corresponding smoothness of the 
fixed point as a function of the parameters z u ,z+. 

The foliation property is then clear, because leaves corresponding to 
different z' u ,z” £ W u (0,(7) cannot intersect as we had otherwise z = 
Uz' u ,z + (0) = y~" lZ+ ( 0) for some z with z + = P+z, hence also z' u = <j>(z' u ,Q) = 
<P(z, 0) - y z ' u ,z+{ 0) = 4>{z, 0) - y z n, z +{ 0) = 

As the leaves depend smoothly on z u , they approach the stable manifold 
W s (0) at the same speed as z u does. More precisely, any orbit <j>(z u ,t ) con- 
verges to 0 exponentially for t — > — oo, and the leaf over <j>(z u ,t) then has to 
converge exponentially to the one over 0 which is W s (0). 

The last statement easily follows by changing signs appropriately, for ex- 
ample by replacing t by — t throughout. □ 
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Perspectives. The theory of stable and unstable manifolds for a dynamical sys- 
tem is classical. Our presentation is based on the one in [49], although we have 
streamlined it somewhat by consistently working with function spaces with expo- 
nential weights. 


6.4 Limits of Trajectories of the Gradient Flow 


As always in the chapter, A is a complete Riemannian manifold, with metric 
(•,•), associated norm || • ||, and distance function f : X — > R is a 

C 2 -function. We consider the negative gradient flow 

x(t) = - grad for f € R 

x(0) = x for x € X. 

We assume that the norms of the first and second derivative of / are bounded. 
Applying the Picard-Lindelof theorem (see § 1.6), we then infer that our flow 
is indeed defined for all t £ M. Also, differentiating (6.4.1), we get 

x{t)(= V d_x{t)) = —(V_a_ grad f(x(t)))x(t) 

= (V a grad f(x(t))) grad f(x(t)). 

ox 

In particular, the first and second derivative of any flow line is uniformly 
bounded. For later use, we quote this fact as: 

Lemma 6.4.1 There exists a constant Co with the property that for any so- 
lution x(t) of (6.4.1), 

l|*||c 1 (E,TX) < Co- 
in particular, x(t) is uniformly Lipschitz continuous. 

(6.4.1) is a system of so-called autonomous ordinary differential equations, 
meaning that the right hand side does not depend explicitly on the “time” t, 
but only implicitly through the solution x(t). 

In contrast to the previous § where we considered the local behaviour of 
this flow near a critical point of /, we shall now analyze the global properties, 
and the gradient flow structure will now become more important. 

In the sequel, x(t) will always denote a solution of (6.4.1), and we shall 
exploit (6.4.1) in the sequel without quoting it explicitly. We shall call each 
curve x(t), t £ M, a flow line, or an orbit (of the negative gradient flow). We 
also put, for simplicity 

x(±oo) := lim x(t), 


assuming that these limits exist. 
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Lemma 6.4.2 The flow lines of (6.4.1) are orthogonal to the level hypersur- 
faces f = const. 

Proof. This means the following: If for some t G R, V £ T x ^X is tangent to 
the level hypersurface {y : f{y) = f(x(t))}, then 

<V,±(t)>= 0. 


Now 

(V,x(t)) = -( V . , grad f(x(t))) 

= — V(f)(x(t)) by the definition of grad/, see (2.1.14) 

= 0 since V is tangent to a hypersurface on which 

f is constant. 

□ 

We compute 

j t f{- x (t)) = df(x(t))x(t) = ( grad f(x(t)),x(t)) by (2.1.14) 

= -||i(i)H 2 - (6-4.2) 

As a consequence, we observe 

Lemma 6.4.3 f is decreasing along flow lines. In particular, there are no 
nonconstant homoclinic orbits, i.e. nonconstant orbits with 

x{— oo) = a:(oo). 


□ 

Thus, we see that there are only two types of flow lines or orbits, the 
“typical” ones diffeomorphic to the real axis (— 00 , 00 ) on which / is strictly 
decreasing, and the “exceptional” ones, namely those that are reduced to 
single points, the critical points of /. The issue now is to understand the 
relationship between the two types. 

Another consequence of (6.4.2) is that for t \ , f -2 G M 

*2 *2 

f{x(h)) - f{x{t 2 )) = - J J t f{x{t))dt = J ||a;(t )|| 2 

t\ ti 

t2 

= J II grad f{x{t))\\ 2 dt. (6.4.3) 

tl 


We also have the estimate 
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||±(<)|| 2 (iiJ 

by Holder’s inequality (6.4.4) 

by (6.4.3). 

Lemma 6.4.4 For any flow line, we have fort — » ±oo that grad f(x{t )) — > 0 
or \f( x (t))\ oo. 


t 2 /t 2 

d{x(t 1 ),x(t 2 )) < J \\x(t)\\ dt < (t 2 - h)^ ij 

ti \ti 


Proof. If e.g. /oo = liirij^oo f(x(t)) > — oo, then for 0 < t < oo 


fo ■= f{x( 0)) > f(x{t)) > /oo, 


and (6.4.3) implies 



fo /oo ^ OO. 


(6.4.5) 


Since x(t) = — grad f(x(t)) is uniformly Lipschitz continuous by Lemma 6.4.1, 
(6.4.5) implies that 


lim grad f(x(t)) = lim x(t) = 0. 

t — >-oo t—> OO 


□ 


We also obtain the following strengthening of Corollary 6.3.1: 

Corollary 6.4.1 The stable and unstable manifolds W s (x), W u (x) of the 
negative gradient flow <j> for a smooth function f are embedded manifolds. 

Proof. The proof is an easy consequence of what we have already derived, 
but it may be instructive to see how all those facts are coming together here. 

We have already seen in Corollary 6.3.1 that W s (x ) andW“(a:) are in- 
jectively immersed. By Corollary 1.6.1, each point in X is contained in a 
unique flow line, but the typical ones of the form (— 00 , 00 ) are not compact, 
and so, their closures may contain other points. By Lemma 6.4.4, any such 
point is a critical point of /. The local situation near such a critical point 
has already been analyzed in Theorem 6.3.1. The only thing that still needs 
to be excluded to go from Corollary 6.3.1 to the present statement is that 
a flow line x{t ) emanating at one critical point x{— 00 ) returns to that same 
point for t — > 00 . This, however, is exluded by Lemma 6.4.3. □ 

In the sequel, we shall also make use of 
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Lemma 6.4.5 Suppose (x n ) n& ^ C X converges to Xq. Then for any T > 

0 , the curves x n (t)\[_x,T] (with a: n (0) = x n ) converge in C 1 to the curve 

x o(t)\[-T,T\ ■ 

Proof. This follows from the continuous dependence of solutions of ODEs on 
the initial data under the assumption of the Picard-Lindelof theorem (the 
proof of that theorem is based on the Banach fixed point theorem, and the 
fixed point produced in that theorem depends continuously on a parameter, 
cf. J.Jost, Postmodern Analysis, Springer, 1998, p.129). Thus the curves x n (t) 
converge uniformly to xq ( t) on any finite interval [— T, T]. By Lemma 6.4.1, 
x n (t) are uniformly bounded, and so x n has to converge in C 1 . □ 

We now assume for the remainder of this § that f satisfies the Palais- 
Smale condition (PS), and that all critical points of f are nondegenerate. 

These assumptions are rather strong as they imply 

Lemma 6.4.6 / has only finitely many critical points in any bounded region 
of X, or, more generally in any region where f is bounded. In particular, in 
every bounded interval in R there are only finitely many critical values of f, 

1. e. 7 £ R for which there exists p £ X with df(p) = 0, f(p) = 7. 

Proof. Let (p n )neN C X be a sequence of critical points of /, i.e. df(p n ) = 
0. If they are contained in a bounded region of X, or, more generally, if 
f(p n ) is bounded, the Palais-Smale condition implies that after selection of a 
subsequence, they converge towards some critical point pq. By Thm. 6.3.1, we 
may find some neighborhood U of po in which the flow has the local normal 
form as described there and which in particular contains no other critical 
point of f besides po. This implies that almost all p n have to coincide with 
Po , and thus there can only be finitely many of them. □ 

Our assumptions - (PS) and nondegeneracy of all critical points - also 
yield 

Lemma 6.4.7 Let x(t) be a flow line for which f(x(t)) is bounded. Then 
the limits :r(±oo) := lim^-too x(t) exist and are critical points of f. x(t) 
converges to x(±oo) exponentially as t — ► ±00. 

Proof. By Lemma 6.4.4, grad f(x[t)) — > 0 for t — > ±00. Analyzing w.l.o.g. 
the situation t — > —00, (PS) implies that we can find a sequence (t n ) ne n C R, 
t n — > —00 for n — » 00, for which x(t n ) converges to some critical point £_oo 
of /. We wish to show that lim t _,_ 00 x(t) exists, and it then has to coincide 
with X-oq. 

This, however, directly follows from the nondegeneracy condition, since 
by Thm. 6.3.1 we may find a neighborhood U of the critical point X-oo 
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with the property that any flow line in that neighborhood containing x_x 
as an accumulation point of some sequence x(t n ), t n — > — oo, is contained 
in the unstable manifold of X-x- Furthermore, as shown in Thm. 6.3.1, the 
convergence is exponential. □ 

Remark. Without assuming that the critical point x(—oo) is nondegenerate, 
we still may use (PS) (see Lemma 6.4.8 below) and grad f(x(t)) — > 0 for 
t — > —oo to see that there exists to £ R for which U := {x(t) : t < to} is 
precompact and in particular bounded. By Taylor expansion, we have in U 

|| grad f(x) || < || grad f(x-x)\\ + cd(x, X-x) = cd(x, X-x) 
for some constant c, as grad f(x-x) = 0. 

Thus, for t < t n 

t t 

d(x(t),x-x) < J \\x(s)\\ ds < c j d(x(s),x_x) ds. 

— OO — OO 

The latter integral may be infinite. As soon as it is finite, however, we already 
get 

d(x(t),X-x) < Cie ct for some constant ci, 
i.e. exponential convergence of x(t) towards X-x as t — > — oo. 

We shall also use the following simple estimate 

Lemma 6.4.8 Suppose || grad /(a:(f))|| > e, for t\ <t<t 2 - Then 

d(x(t i),x(t 2 )) < ~(f{x(t i)) - f(x(t 2 )). 

£ 

Proof. 

t2 

d(x(t i),x(t 2 ))< J \\x(t)\\dt 

tl 

j II^WII 2 ^ since II^WII = II s rad /( a: ( i ))ll > £ 

= - f(x{t 2 )) by (6.4.3). 

£ 

□ 

We now need an additional assumption: 
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There exists a flow-invariant compact set X? C X containing the critical 
points p and q. 

What we have in mind here is a certain set of critical points together with 
all connecting trajectories between them. We shall see in Thm. 6.4.1 below 
that we need to include here all critical points that can arise as limits of flow 
lines between any two critical points of the set we wish to consider. 

Lemma 6.4.9 

Let (x„(t)) n gN be a sequence of flow lines in X? with 
x n (-oo) = p, x n (oo) = q. 

Then after selection of a subsequence, x n {t) converges in C 1 on any com- 
pact interval in R towards some flow line Xo (t). 

Proof. Let to € R. If (for some subsequence) 

|| grad f{x n {t 0 )) || -> 0, 

then by (PS) (71 = f(jp), 72 = f(q), noting f(p) > f(x(t)) > f(q) by 
Lemma 6.4.3), we may assume that x n {to) converges, and the convergence of 
the flow lines on compact intervals then follows from Lemma 6.4.5. We thus 
assume 

|| grad f(x n (to))\\ > £ for all n and some e > 0. 

Since f(x n (t)) is bounded between f(p) and f{q), Lemma 6.4.4 implies that 
we may find t n < to with 

|| grad f(x n (t n )) || = £ 
and || grad f{x n (t))\\ >e for t n < t < t 0 . 

From (6.4.3), we get \t n -t 0 \ < J*(/(t„)-/(to)) < |r(/(p) -/(<?))• Applying 
our compactness assumption on Xfl we may assume that x n (t n ) converges. 
From Lemma 6.4.5 we then see that x n (t) converges on any compact interval 
towards some flow line Xq (t). □ 

In general, x n (t) will not converge uniformly on all of R towards Xq (t). 
We need an additional assumption as in the next 

Lemma 6.4.10 Under the assumption of Lemma 6.f.9, assume 

Zo(-oo) =p, x 0 (oo) = q, 

i.e. Xq ( t) has the same limit points as the x n {t). Then the x n (t) converge to 
Xq ( t) in the Sobolev space H 1 ' 2 ( R,A). In fact, this holds already if we only 
assume f(x 0 (-oo )) = f(p),f(x 0 { 00)) = f(q). 


Proof. The essential point is to show that 
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lim x n (t) = p, lim x n (t) = q, uniformly in n. 

t — >—oo t—> oo 

Namely in that case, we may apply the local analysis provided by Thm. 6.3.1 
uniformly in n to conclude convergence for t < t\ and t > <2 for certain 
t\ . t ‘2 £ R, and on the compact interval [Li, tf, we get convergence by the 
preceding lemma. 

Because of (PS), we only have to exclude that after selection of a subse- 
quence of x n (t), we find a sequence (t n ) C R converging to oo or — oo, 
say — oo, with 


|| grad f(x n (t n )) || > e for some £ > 0. (6.4.6) 

From (6.4.4), we get the uniform estimate 

|| grad f(x n (ti)) — grad /(a; n (t 2 ))|| < c (<2 — ti)^ for some constant c. 

(6.4.7) 

By (6.4.6), (6.4.7), we may find <5 > 0 such that for t n — S < t < t n 

|| grad f(x n (t )) || > |, 

hence 

f(p) - f(x n (t„)) > f(x n (t n - S)) - f(x n (t n )) > by (6.4.3). 

On the other hand, by our assumption on Xg (t), we may find f 0 €l with 

f(p) - f(xo(to)) = 6^-- (6.4.8) 

If t n < to, we have 

f(p) - f(x„(t 0 )) > f(p ) - f(x n (t n )) > S^, 

and so x n (t o) cannot converge to xo(to), contrary to our assumption. Thus 
(6.4.6) is impossible, and the proof is complete, except for the last remark, 
which, however, also directly follows as the only assumption about Xq (t) that 
we need is (6.4.8). □ 

We are now ready to demonstrate the following compactness 

Theorem 6.4.1 Let p,q be critical points of f, and let M f p q C X? be a 
space of flow lines x(t)(t £ R) for f with x(—oo) = p, x(oo) = q. Here 
we assume that X? is a flow-invariant compact set. Then for any sequence 
(x n (t)) n gN C Xdp qi after selection of a subsequence, there exist critical points 


P = Pi,P2,---,Pk = q , 
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flow lines yi € M- Pi Pi+1 and t n t i el (i = 1 , — , k — 1 , n € N) such that the 
flow lines x n (t + t n ; ,) converge to yi for n — > oo. In this situation, we say 
that the sequence x n (t) converges to the broken trajectory y\ffyiff ■ ■ ■ ffyk- 1 - 

Proof. By Lemma 6.4.9, x n (t) converges (after selection of a subsequence, as 
always) towards some flow line #o(t). x o(t) need not be in M- pq , but the limit 
points xo(— oo), a;o(oo) (which exist by Lemma 6.4.7) must satisfy 

f(P ) > /(xo(-oo)) > f(x Q ( oo)) > f(q). 

If e.g. f(p) = f(x o(— oo)) then the proof of Lemma 6.4.10 shows that 
Zo(-°o) = P- 

If f(p) > f(x 0 (— oo)), we choose f(x 0 (— oo)) < a < f(p) and t n>i with 

f{x n {t n ,i )) = a. 

We apply Lemma 6.4.9 to x n (t + t n> i) to get a limiting flow line yo(t). Clearly 
f(p) > f(yo(—oo)), and we must also have 

f(yo(oo)) > f(x o(-oo)), 

because otherwise the flow line yo(t) would contain the critical point xq(— oo) 
in its interior. 

If f(p) > f(yo(~oo)) of f{yo(oo)) > f(x 0 (- oo)), we repeat the process. 
The process must stop after a finite number of such steps, because the critical 
points of / are isolated because of (PS) and the nondegeneracy assumption 
yielding to the local picture of Thm. 6.3.1 (see Lemma 6.4.6). □ 


6.5 The Morse-Smale-Floer Condition: TVansversality 
and Z 2 -Cohomology 

In this §, we shall continue to assume the Palais-Smale condition and the 
nondegeneracy of all critical points of our function / : X — > R. Here, we 
assume that / is of class C 3 . 

The central object of Morse-Floer theory is the space of connecting tra- 
jectories between the critical points of a function /. If f is bounded, then by 
Lemma 6.4.6, any x € X lies on some such trajectory connecting two critical 
points of /. In the general case, one may simply restrict the considerations 
in the sequel to the subspace X? of X of such connecting trajectories, and 
one may even consider only some subset of the critical points of / and the 
connecting trajectories between them, including those limiting configurations 
that arise by Thm. 6.4.1. As in § 6.4, we need to assume that the set of flow- 
lines under consideration is contained in a compact flow-invariant set. Thus, 
we shall assume X is such a closed space of connecting trajectories. 
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X then carries two stratifications S s and S u , consisting of the stable 
resp. unstable manifolds of the critical points of f. Thus, each point lies on 
precisely one stratum of S s , and likewise on one stratum of S u , and each 
such stratum is a smooth manifold, by Cor. 6.3.1. 

Definition 6.5.1 The pair ( X,f ) satisfies the Morse-Smale-Floer condition 
if all intersections between the strata of S 3 and the ones of S u are finite- 
dimensional and transversal. 

We recall that two submanifolds X\ , X 2 of X intersect transversally if for 
all x £ Xi fl X 2 , the tangent space T x X is the linear span of the tangent 
spaces T x X\ and T x X 2 . If the dimension of X is finite, then if X\ and X 2 
intersect transversally at x, we have 

dim Xi + dimX 2 = dim(Xi D X 2 ) + dim X. (6.5.1) 

It easily follows from the implicit function theorem that in the case of a 
transversal intersection of smooth manifolds Xi,X 2 , X 1 fl X 2 likewise is a 
smooth manifold. 

In addition to (PS) and the nondegeneracy of all critical points of f, we 
shall assume for the rest of this § that (X, /) satisfies the Morse-Smale- 
Floer condition. 

Definition 6.5.2 Let p, q be critical points of /. If the unstable manifold 
W u (p) and the stable manifold W s (q) intersect, we say that p is connected 
to q by the flow, and we define the relative index of p and q as 

p(p,q) := dim (W u (p) HWflq)). 

p(p, q) is finite because of the Morse-Smale-Floer condition. 

If X is finite dimensional, then the Morse indices p(p) of all critical points 
p of / themselves are finite, and in the situation of Def. 6.5.2, we then have 

p{p, q) = p{p) - p{q) (6.5.2) 

as one easily deduces from (6.5.1). Returning to the general situation, we 
start with the following simple observation 

Lemma 6.5.1 Any nonempty intersection W u (p) fl W s (q) (p, q £ C(f),p ^ 
q) is a union of flow lines. In particular, its dimension is at least one. 

Proof. If x £ W u (p), then so is the whole flow line x(t) (x(0) = x), and the 
same holds for x £ W s (q). □ 

p is thus connected to q by the flow if and only if there is a flow line x(t) 
with x(— 00 ) = p and x(oo) = q. Expressed in another way, the intersections 
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W u (p) IT W s (q) are flow invariant. In particular, in the case of a nonempty 
such intersection, p and q are both contained in the closure of W u (p)PW s {q). 

The following lemma is fundamental: 

Lemma 6.5.2 Suppose that p is connected to r and r to q by the flow. Then 
p is also connected to q by the flow, and 

p(p,q) = p{p,r) +p(r,q). 


Proof. By assumption, W u (p) intersects W s (r) transversally in a manifold 
of dimension p(p, r). Since W s (r) is a leaf of the smooth stable foliation of r 
in some neighborhood U of r by Thm. 6.3.2, in some possibly smaller neigh- 
borhood of r, W u (p) intersects each leaf of this stable foliation transversally 
in some manifold of dimension p(p,r). Similarly, in the vicinity of r, W s (q) 
also intersects each leaf of the unstable foliation of r in some manifold, this 
time of dimension /z(r, q). Thus, the following considerations will hold in some 
suitable neighborhood of r. 

The space of leaves of the stable foliation of r is parametrized by W u (r), 
and we thus get a family of p(p, r)-dimensional manifolds parametrized by 
W u (r). Likewise, we get a second family of /r(r, (^-dimensional manifolds 
parametrized by W s (r). The leaves of the stable and unstable foliations sat- 
isfy uniform C-estimates (in the vicinity of r) by Thm. 6.3.2, because of our 
assumption that / is of class C 3 . The two finite-dimensional families that we 
have constructed may also be assumed to satisfy such uniform estimates. The 
stable and unstable foliations yield a local product structure in the sense that 
each point near r is the intersection of precisely one stable and one unstable 
leaf. 

If we now have two such foliations with finite-dimensional smooth sub- 
families of dimension m and ri 2 , say, all satisfying uniform estimates, it then 
easily follows by induction on rii and 712 that the leaves of these two subfami- 
lies need to intersect in a submanifold of dimension m + ?i 2 - The case where 
rii = 712 = 0 can be derived from the implicit function theorem. □ 

We also have the following converse result 

Lemma 6.5.3 In the situation of Thm. 6.f.l, we have 

k - 1 

^2p(Pi,Pi+ 1 ) = p(p, q)- 

i= 1 

Proof. It suffices to treat the case k = 3 as the general case then will easily 
follow by induction. This case, however, easily follows from Lemma 6.5.2 with 
P = Pi,r =p 2 , q = Pz- □ 
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We shall now need to make the assumption that the space X? of con- 
necting trajectories that we are considering is compact. (At this moment, we 
are considering the space W u (p) fl W s (q).) 

Lemma 6.5.4 Suppose that p,q (p q) are critical points of f, connected 
by the flow, with 

p(p, q) = 1. 

Then there exist only finitely many trajectories from p to q. 

Proof. For any point x on such a trajectory, we have 

f(p) > f(x) > /(g)- 

We may assume that £ > 0 is so small that on each flow line from p to q , 
we find some x with || grad /(a;)|| = e, because otherwise we would have 
a sequence of flow lines (sj)j e j,j from p to q with sup,j, 6s . || grad f(x) || — > 0 
for i —> oo. By (PS) a subsequence would converge to a flow line s (see 
Lemma 6.4.5) with grad f(x) = 0 on s. s would thus be constant, in con- 
tradiction to Thm. 6.4.1. Thus, if, contrary to our assumption, we have 
a sequence (si)i S N of trajectories from p to q , we select Xi € Sj with 
|| grad f(xi)\\ = e, use the compactness assumption on the flow-invariant 
set containing the Sj to get a convergent subsequence of the x^, hence also of 
the Si by Thm. 6.4.1. The limit trajectory s also has to connect p to q , because 
our assumption p(p,q) = 1 and Lemmas 6.5.1 and 6.5.3 rule out that s is a 
broken trajectory containing further critical points of /. The Morse-Smale- 
Floer condition implies that s is isolated in the one-dimensional manifold 
W u (p) fl W s (q). This is not compatible with the assumption that there ex- 
ists a sequence (si) of different flow lines converging to s. Thus, we conclude 
finiteness. □ 

We can now summarize our results about trajectories: 

Theorem 6.5.1 Suppose our general assumptions (f G C 3 , (PS), nondegen- 
eracy of critical points, Morse-Smale- Floer condition) continue to hold. Let 
p, q be critical points of f connected by the flow with 

h(p, q) = 2 . 

Then each component of the space of flow lines from p to q, A4^ q := W u (p)C] 
W s ( q ) either is compact after including p,q ( and diffeomorphic to the 2- 
sphere), or its boundary (in the sense of Thm. S.f.l) consists of two different 
broken trajectories from p to q. 

Conversely each broken trajectory s = s\ffs 2 from p to q (this means that 
there exists a critical point p' of f with fj,(p,p') = 1 = p(p ' , q), Si(— oo) = p, 
si(oo) = p' = S 2 ( — oo) , 52 ( 00 ) = q) is contained in the boundary of precisely 
component of M.^ q . 


one 
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Note. Let and s'{ jfs 2 be broken trajectories contained in the boundary 

of the same component of M-p q - It is then possible that sj = s " or s 2 = s 2 , 
but the theorem says that we cannot have both equalities simultaneously. 

Proof. If a component A4 of is compact then it is a 2— dimensional 

manifold that is a smooth family of curves, flow lines from p to q with common 
end points p, q , but disjoint interiors. Thus, such a component is diffeomorphic 
to S 2 . 

If A4 is not compact, Thm. 6.4.1 implies the existence of broken trajec- 
tories from p to q in the boundary of this component. 

Let a be a regular value of / with f{p) > a > f(q). By Lemma 6.4.2, M 
intersects the level hypersurface / _1 (a) transversally, and M. fl/ _1 (a) thus 
is a 1— dimensional manifold. It can thus be compactified by adding one or 
two points. By Thm. 6.4.1, these points correspond to broken trajectories 
from p to q. We thus need to exclude that A4 can be compactified by a 
single broken trajectory s\ffs 2 . We have Si(— oo) = p, s 2 (oo) = q, and we 
put p' := Si(oo) = S 2 ( — oo) . In view of the local normal form provided by 
Thm. 6.3.2, we have the following situation near pt\ A 4^ q is a smooth surface 
containing Si in its interior. q then intersects a smooth 1— dimensional 
family of leaves of the stable foliation near p' in a 1— dimensional manifold. 
The family of those stable leaves intersected by A 4^ q then is parametrized 
by a smooth curve in W u (p') containing p' in its interior. It thus contains the 
initial pieces of different flow lines originating from p in opposite directions, 
and these flow lines are contained in limits of flow lines from A4^ . Therefore, 
in order to compactify Mp q in W u (p'), a single flow line S 2 does not suffice. 


w‘(pO 



Fig. 6.5.1. 


Finally, if a broken trajectory through some p' would be a 2— sided limit of 
M f pq , this again would not be compatible with the local flow geometry near 
p' as just described. □ 
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Definition 6.5.3 Let C* (/, Z 2 ) be the free Abelian group with Z 2 -coefficients 
generated by the set C*(/) of critical points of /. For p £ C*(/), we put 

dp ■= E (#z 2 M^ r ) r 

r€C*(/) 

^(p,r) = l 

where ffz 2 Xi^ r is the number mod 2 of trajectories from p to r (by Lemma 
6.5.4 there are only finitely many such trajectories), and we extend this to a 
group homomorphism 


d : C*(/, Z 2 ) — > C*(/, Z 2 ). 


Theorem 6.5.2 We have 

d o dp = 0, 

and thus (C*(/, Z 2 ),<9) is a chain complex. 

Proof. We have 

d o dp = E E #Z 2 *Mp,r #z 2 M^ q q. 

r€C*U) qec*(f) 
p(p,r) = l p(r,g) = l 

We are thus connecting the broken trajectories from p to g for q £ C*(f) 
with p(p, q ) = 2, by Lemma 6.5.1. By Thm. 6.5.1 this number is always even, 
and so it vanishes mod 2. This implies d o dp = 0 for each p £ and 

thus the extension to C*(/, Z 2 ) also satisfies do d = 0. □ 

We are now ready for 

Definition 6.5.4 Let / be a C 3 function satisfying the Morse-Smale-Floer 
and Palais-Smale conditions, and assume that we have a compact space X 
of trajectories as investigated above. If we are in the situation of an absolute 
Morse index, we let Ck{f, Z 2 ) be the group with coefficient in Z 2 generated 
by the critical points of Morse index k. Otherwise, we choose an arbitrary 
grading in a consistent manner, i.e. we require that if p £ Ck(f ), q £ Ci(f), 
then 

k-l = p(p, q) 

whenever the relative index is defined. We then obtain boundary operators 
5 = ^:C , fc (/,Z 2 )-C fc _ 1 (/,Z 2 ) ) 
and we define the associated homology groups as 

ker dk 


H k (X, /, Z 2 ) : 


image d k+1 
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i.e. two elements a\,a 2 G ker d k are identified if there exists some (3 G 
C k+1 (f,Z 2 ) with 

ai — a 2 = dp. 


Instead of a homology theory, we can also define a Morse-Floer cohomo- 
logy theory by dualization. For that purpose, we put 

C k (f, Z 2 ):=Hom (C k (f, Z 2 ),Z 2 ) 

and define coboundary operators 

(5 fc : C fc (/, Z 2 ) — > C fc+1 (/, Z 2 ) 

by 

(5 fc w fc (p fc+ i) = w fc (9 fc+ ip fc +i) 
for ui k G C k (f,Z 2 ) and p fc+ i G C k (f, Z 2 ). 

If there are only finitely many critical points pi,fe, ..., p m ,k of index fc, then 
we have a canonical isomorphism 

C fe (/,Z 2 )^C fc (/,Z 2 ) 

e- > p k with p k (pi, k ) = 5tj (= 1 for i = j and 0 otherwise) 

and 

S k P k = E p i>fc (%, fe+1 )gf +1 

9i,fe+l critical point of/ of indexfc+1 

provided that sum is finite, too. Of course, this cohomology theory and the 
coboundary operator 5 can also be constructed directly from the function /, 
by looking at the positive instead of the negative gradient flow, i.e. at the 
solution curves of 

y : R — » X 

y(t) =grad f(y(t)) for all t 

The preceding formalism then goes through in the same manner as before. 

Remark. In certain infinite dimensional situations in the calculus of varia- 
tions, there may be an analytic difference between the positive and negative 
gradient flow. Often, one faces the task of minimizing a certain function 
/ : X — » R. that is bounded from below, but not from above, and then also of 
finding other critical points of such a function. In such a situation, flow lines 
for the negative gradient flow 

x{t) = — grad/(ar(t)) 

might be well controlled, simply because / is decreasing on such a flow line, 
and therefore bounded, while along the positive gradient flow 

■y(t) = grad f(y{t)), 

f may not be so well controlled, and one may not be able to derive the 
asymptotic estimates necessary for the analysis. 
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6.6 Orientations and Z-homology 


In the present §, we wish to consider the group C*(/, Z) with integer coeffi- 
cients generated by the set C*(f) of critical points of / and define a boundary 
operator 


satisfying 


d o d = 0 


as in the Z 2 -case, in order that (C*(/, Z),d) be a chain complex. We assume 
that the general assumptions of § 6.5 (/ € C 3 , (PS), nondegeneracy of critical 
points, Morse-Smale-Floer condition) continue to hold. 

We shall attempt to define d as in Def. 6.5.3, by counting the number of 
connecting trajectories between critical points of relative index 1, but now we 
cannot simply take that number mod 2, but we need to introduce a sign for 
each such trajectory and add the corresponding signs ±1. In order to define 
these signs, we shall introduce orientations. 

In order to motivate our subsequent construction, we shall first consider 
the classical case where X is a finite dimensional, compact, oriented, dif- 
ferentiable manifold. Let / : X — > R thus be a Morse function. The index 
p(p) of a critical point p is the number of negative eigenvalues of d 2 f(p), 
counted with multiplicity. The corresponding eigenvectors span the tangent 
space Vp C T p X of the unstable manifold W u {p) at p. We choose an arbi- 
trary orientation of Vp, i.e. we select some basis e 1 , . . . , of Vp as being 
positive. Alternatively, we may represent this orientation by dx 1 A. . .A dx^ p \ 
where dx 1 , . . . , dx are the cotangent vectors dual to e 1 , . . . , e p ^ p \ 

As X is assumed to be oriented, we get an induced orientation of the 
tangent space Vp C T p X of the stable manifold W s (p) by defining a ba- 
sis e A ‘( p ) +1 , . . . ,e n (n = dim A) as positive if e 1 , . . . , e^ p \ e p ^ +1 , . . . , e n is a 
positive basis of T p X. In the alternative description, with dx p ^ +1 , ....... dx n 

dual to e ^ p ) +1 , ... ,e n , the orientation is defined by dx^ p ' >+1 A ... A dx n pre- 
cisely if dx 1 A . . . dx p ^ A dx p ^ +1 A ... A dx n yields the orientation of T p X. 

Now if q is another critical point of /, of index n{q) = fi(p) — 1, we choose 
any regular value a of / with f(q) < a < f(p) and consider the intersection 

W u {p)nW s (q)nf~ 1 {a). 

The orientation of X also induces an orientation of because ,/ _1 (a) 

is always transversal to grad/, and so we can consider a basis rj 2 ,. . . ,r] n of 
T y f~ 1 {a) as positive if grad f{y),rj 2 , . . . , t) n is a positive basis of T y X. 

As we are assuming the Morse-Smale-Floer condition, 

w u (p)nw s (q)nf- 1 (a) 


is a finite number of points by Lemma 6.5.4, and since W u (p) , W s (p) and 
/ _1 (a) all are equipped with an orientation, we can assign the sign +1 or 
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— 1 to any such intersection point depending on whether this intersection is 
positive or negative. 

These intersection points correspond to the trajectories s of / from p to 
q , and we thus obtain a sign 

n(s) = ±1 

for any such trajectory, and we put 

dp := n(s)r. 

r€C,(f) 

V(r) = n(p) — 1 

It thus remains to show that with this definition of the boundary operator 
d , we get the relation 

d o d = 0. 

In order to verify this, and also to free ourselves from the assumptions that X 
is finite dimensional and oriented and to thus preserve the generality achieved 
in the previous §, we shall now consider a relative version. 

We let p , q be critical points of / connected by the flow with 

= 2 , 

and we let M. be a component of M.p q = W u (p)C\ W s (q). For our subsequent 
analysis, only the second case of Thm. 6.5.1 will be relevant, i.e. where A4 
has a boundary which then consists of two different broken trajectories from 
p to q. It is clear from the analysis of the proof of Thm. 6.5.1 that Ad is 
orientable. In fact, Ad is homeomorphic to the open disk, and it contains 
two transversal one-dimensional foliations, one consisting of the flow lines of 
/ and the other one of the intersections of M with the level hypersurfaces 
/ _1 (a), f(q) < a < f(p) (as M does not contain any critical points in its 
interior, all intersections with level hypersurfaces of / are transversal). We 
may thus choose an orientation of Ad . 



Fig. 6.6.1. 
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This orientation then also induces orientations of the corner points of the 
broken trajectories in the boundary of AA in the following sense: Let s = 
s\f)s 2 be such a broken trajectory, with intermediate critical point r = 
si(oo) = S 2 (— oo). The plane in T r X spanned by s'i(oo) := lim t ^ 00 s'i(i) 

and S 2 (— oo) := lirrp ^ .s' 2 (t) then is a limit of tangent planes of AA and 

thus gets an induced orientation from AA. 

This now implies that if we choose an orientation of si, we get an induced 
orientation of 82, by requiring that if Vi, V 2 are positive tangent vectors of si 
and 82, resp. at r, then V\ , V2 induces the orientation of the above plane in 
T r X. Likewise, AA n f~ 1 (a), for f(q) < a < f(p) gets an induced orientation 
from the one of A4 and the one of the flow lines inside A4 which we always 
orient by —grad/. Then the signs n(s 1 ), n(s 2 ) of Si and S 2 , resp. are defined 
by checking whether si resp. S 2 intersects these level hypersurfaces / _1 (a) 
positively or negatively. Alternatively, what amounts to the same is simply 
checking whether si, S 2 have the orientation defined by —grad/, or the op- 
posite one, and thus, we do not even need the level hypersurfaces f^ 1 (a). 

Obviously, the problem now is that the choice of orientation of many 
trajectories connecting two critical points p , r of relative index p(p, r) = 1 
depends on the choice of orientation of some such AA containing s in its 
boundary, and the question is whether conversely, the orientations of these 
AA can be chosen consistently in the sense that they all induce the same 
orientation of a given s. In the case of a finite dimensional, oriented manifold, 
this is no problem, because we get induced orientations on all such AA from 
the orientation of the manifold and choices of orientations on all unstable 
manifolds, and these orientations fit together properly. In the general case, 
we need to make the global assumption that this is possible: 

Definition 6.6.1 The Morse-Smale-Floer flow / is called orientable if we 
may define orientations on all trajectories A4^ } q for critical points p, q with 
relative index p(p, q) = 2 in such a manner that the induced orientations on 
trajectories s between critical points of relative index 1 are consistent. 

With these preparations, we are ready to prove 

Theorem 6.6.1 Assume that the general assumptions (f € C 3 , (PS)), non- 
degeneracy of critical points, Morse-Smale-Floer conditions continue to hold, 
and that the flow is orientable in the sense of Def. 6.6.1. For the group 
Cflf, Z) generated by the set C*(/) of critical points of f, with integer coef- 
ficients, the operator 

d : C.(f,Z) ^ C.(f,Z) 



334 6. Morse Theory and Floer Homology 


defined by 

dp := n(s)r 

P-(p,r) = 1 

for p € C*(/) and linearly extended to C*(f,Z), satisfies 

d o d = 0. 


Thus, C*((/, Z),9) becomes a chain complex, and we may define homology 
groups H k (X,f, Z) in the same manner as in Def. 6.5-4- 

Proof. We have 

d o dp = E E n(s 2 )n(s 1 )q 

qec*(f) r€C*(/) 
mO>9) = 1 M(P.T') = 1 

s l^ M r,p 

= E n(s 2 )n(si)g. 

qec*(f) 
p(p,q )= 2 

(si ,S2) broken trajectory fromp to g 

By Thm. 6.5.1, these broken trajectories always occur in pairs (s 7 !, s' 2 ), 
(s" 1 ,s" 2 ) bounding some component M of M.p q - 
It is then geometrically obvious, see Fig. 6.6.1, that 

n(s[)n(s' 2 ) = -n(si 7 )?i(s 2 )- 

Thus, the contributions of the two members of each such pair cancel each 
other, and the preceding sum vanishes. □ 

In the situation of Thm. 6.6.1, we put 

b k (XJ) := dim z H k (X, /, Z). 

We shall see in §§ 6.7, 6.9 that these numbers in fact do not depend on f. 
As explained at the end of the preceding §, one may also construct a dual 
cohomology theory, with 

C k (f, Z) :=Hom(C fc (/,Z),Z) 

and coboundary operators 

S k : C k (f, Z) -> C k+1 (f, Z) 


with 

5 k u> k (p k+ 1 ) = oj k {d k+ iPk+i) 
for uj k G C k (f,Z),p k+ 1 G C fc+ i(/,Z). 
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6.7 Homotopies 

We have constructed a homology theory for a Morse-Smale-Floer function 
/ on a manifold X, under the preceding assumptions. In order to have a 
theory that captures invariants of X, we now ask to what extent the resulting 
homology depends on the choice of /. To formulate the question differently, 
given two such functions J 1 ,/ 2 , can one construct an isomorphism between 
the corresponding homologies? If so, is this isomorphism canonical? 

A first geometric approach might be based on the following idea, consid- 
ering again the case of a finite dimensional, compact manifold: 

Given a critical point p of / 1 of Morse index /i, and a critical point q of / 2 of 
the same Morse index, the unstable manifold of p has dimension fi, and the 
stable one of q dimension n — p if n = dim X. Thus, we expect that generally, 
these two manifolds intersect in finitely many points xi,...,Xk with signs 
n(xj ) given by the sign of the intersection number, and we might put 

<t> 21 {p)= n ^ q ( 6 - 71 ) 

«60.(/2) ieff»(p)nw*( 9 ) 

(*?)=Myl (p) t f 

(we introduce additional indices f 1 ,/ 2 in order to indicate the source of the 
objects) to get a map 

t 21 :C4f\G)^C*(f 2 ,G) 

extended to coefficients G = Z 2 or Z that hopefully commutes with the 
boundary operators d J , d* in the sense that 

(j) 21 o = d? o (j) 21 . (6.7.2) 

One difficulty is that for such a construction, we need the additional 
assumption that the unstable manifolds for / 1 intersect the stable ones for 
f 2 transversally. Even if / 1 and f 2 are Morse-Smale-Floer functions, this 
need not hold, however. For example, one may consider f 2 = —f 1 ; then for 
any critical point p, 

Wp (p) = W s f2 (p) 

which is not compatible with transversality. 

Of course, one may simply assume that all such intersections are transver- 
sal but that would not be compatible with our aim to relate the homology 
theories for any pair of Morse-Smale-Floer functions in a canonical manner. 
We note, however, that the construction would work in the trivial case where 
/ 2 = f 1 , because then Wp(p) and W s p{p) = Wp (p) intersect precisely at 
the critical point p itself. 

In order to solve this problem, we consider homotopies 
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lim F(x,t) = / 1 (x), lim F(x, t) = f 2 (x) for all x G X. 


In fact, for technical reasons it will be convenient to impose the stronger 
requirement that 

F(x,t) = f 1 (x ) for t < — R 

V “ (6-7.3) 

F(x, t) = f 2 (x) for t > R 

for some R > 0. 

Given such a function F, we consider the flow 


x(t) = — gradF(:r(t),f) forte 
a?(0) = x, 


(6.7.4) 


where grad denotes the gradient w.r.t. the variables. In order to avoid 
trouble with cases where this gradient is unbounded, one may instead consider 
the flow 

x(t) = — -gradF(x(t),t), (6.7.5) 

V 1 + I ^ I Igrad F\2 

but for the moment, we ignore this point and consider (6.7.4) for simplicity. 

If p and q are critical points of / 1 and / 2 , resp., with index p the strategy 
then is to consider the number of flow lines s(t) of (6.7.5) with 

s{-oo)=p , s(oo) = q, 

equipped with appropriate signs n(s), denote the space of these flow lines by 
■Mp q , and put 


0 21 (t) = n ^ q - 

qec,(f 2 ) s£M !' 0 
m(q)=m(p) 

Let us again discuss some trivial examples: 

If / 1 = f 2 and F is the constant homotopy, then clearly 

(j> 21 (p) =p, 

for every critical point p. If / 2 = — f 1 and we construct F by 


(6.7.6) 


we have 

for any flow line. Thus, also 


/V) 

for 

—oo < t < —1 



for 

-1 < t < 1 

(6.7.7) 

-f\x) 

for 

1 < t < 00 


s(t) = 

s{-t) 


(6.7.8) 
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s(oo) = s(— oo), 

and a flow line cannot connect a critical p of f 1 of index p/ with a critical 
point q of / 2 of index fJ = n — p? , unless p = q and pf = f • Consequently, 
we seem to have the same difficulty as before. This is not quite so, however, 
because we now have the possibility to perturb the homotopy if we wish to 
try to avoid such a peculiar behavior. In other words, we try to employ only 
generic homotopies. 

In order to formulate what we mean by a generic homotopy we recall the 
concept of a Morse function. There, we required that the Hessian d 2 f(x o) 
at a critical point is nondegenerate. At least in the finite dimensional case 
that we consider at this moment, this condition is generic in the sense that 
the Morse functions constitute an open and dense subset of the set of all C 2 
functions on X. The Morse condition means that at a critical point Xq, the 
linearization of the equation 

x{t) = -grad f(x(t)) 

has maximal rank. A version of the implicit function theorem then implies 
that the linearization of the equation locally already describes the qualitative 
features of the original equation. In this sense, we formulate 

Definition 6.7.1 The homotopy F satisfying (6.7.3) is called regular if when- 
ever 

grad F(xo, t) = 0 for all t € R, 

the operator 

^ + d 2 F(x 0 ,t ) : H 1,2 { X qTX) - L 2 (xqTX) 

is surjective. 

This is satisfied for a constant homotopy, if / 1 is a Morse function, but 
not for the homotopy (6.7.7) because in that case only sections satisfying 
(6.7.8) are contained in the range of + d 2 F(xo,t). 

Let us continue with our heuristic considerations: 

If / 1 is a Morse function as before, (p : ( — oo, 0] — > R + satisfies ip(t) = 1 for 
t < — 1, </?(0) = 0, we consider the flow 

x(t) = — (p(t)gra,d f 1 (x(t)) for — oo < t < 0, 
x(0) = x. 

We obtain a solution for every x £ X, and as before x(— oo) always is a 
critical point of f 1 . Thus, while all the flow lines emanate at a critical point 
for t = —oo, they cover the whole manifold at t = 0. If we now extend ip to 
(0, oo) by putting 


i p(t) := ip(—t) for t > 0, 
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and if we have another Morse function f 2 and put 

x(t) = — <p(f)grad f 2 (x(t)) for t > 0, 

in the same manner, the flow lines will converge to critical points of / 2 at 
t = oo. We thus relate the flow asymptotic regimes governed by / 1 and f 2 
through the whole manifold X at an intermediate step. Of course, this only 
works under generic conditions, and we may have to deform the flow slightly 
to achieve that, but here we rather record the following observation: The 
points x(0) for flow lines with x{— oo) = p cover the unstable manifolds of 
the critical point p of f 1 , and likewise the points x(0) for the flow lines with 
x(oo) = q for the critical point q of f 2 cover the stable manifold of q. Thus 
the flow lines with x{— oo) = p, x(oo) = q correspond to the intersection of 
the unstable manifold of p (w.r.t. f 1 ) with the stable manifold of q (w.r.t. 
/ 2 ), and we now have the flexibility to deform the flow if problems arise from 
nontransversal intersections . 

Let us return once more to the trivial example f 1 = f 2 , and a constant 
homotopy F. We count the flow lines not in X, but in IxR. This simply 
means that in contrast to the situation in previous §§, we now consider the 
flow lines x(-) and x(- + to), for some fixed to € R, as different. Of course, 
if the homotopy F is not constant in t, the time shift invariance is broken 
anyway, and in a certain sense this is the main reason for looking at the 
nonautonomous equation (6.7.4) as opposed to the autonomous one x(t) = 
—grad /(x(t)) considered previously. Returning for a moment to our constant 
homotopy, if p and q are critical points of indices p.(p) and p(q) = p(p) — 1, 
resp. , connected by the flow of f [ , the flow lines for F cover a two-dimensional 
region in Ixl. This region is noncompact, and it can be compactified by 
adding broken trajectories of the type 

si#s 2 

where Si is a flow for / 1 from p to q and s 2 is the constant flow line for 
/ 1 = f 2 from p to q. This looks analogous to the situation considered in 
§6.5, and in fact with the same methods one shows the appropriate analogue 
of Thm. 6.5.1. When it come to orientations, however, there is an important 
difference. Namely, in the situation of Fig. 6.7.1 (where we have compactified 
R to a bounded interval), the two broken trajectories from p to q in the 
boundary of the square should now be given the same orientation if we wish 
to maintain the aim that the homotopy given through (6.7.6) commutes with 
the boundary operator even in the case of coefficients in Z. 
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Fig. 6.7.1. 

The considerations presented here only in heuristic terms will be taken up 
with somewhat more rigour in § 6.9 below. 


6.8 Graph flows 

In this §, we shall assume that X is a compact, oriented Riemannian manifold. 
A slight variant of the construction of the preceding § would be the following: 
Let /i, fi be two Morse-Smale-Floer functions, as before. In the preceding §, 
we have treated the general situation where the unstable manifolds of f\ need 
not intersect the stable ones of /2 transversally. The result was that there 
was enough flexibility in the choice of homotopy between and /2 so that 
that did not matter. In fact, a consequence of that analysis is that we may 
always find a sufficiently small perturbation of either one of the two functions 
so that such a transversality property holds, without affecting the resulting 
algebraic invariants. 

Therefore from now on, we shall assume that for all Morse-Smale-Floer 
functions /i,/ 2 ,--- occuring in any construction in the sequel, all unstable 
manifolds of any one of them intersect all the stable manifolds of all the other 
functions transversally. We call this the generalized Morse-Smale-Floer 
condition. 

Thus, assuming that property, we consider continuous paths 

x : R. — > X 


with 


x(t) = —grad with i = 1 for t < 0, i = 2 for t > 2. 

The continuity requirement then means that we are switching at t = 0 in a 
continuous manner from the flow for fi to the one for / 2 . As we are assuming 
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the generalized Morse-Smale-Floer condition, this can be utilized in the man- 
ner described in the previous § to equate the homology groups generated by 
the critical points of /i and fa resp. 

This construction admits an important generalization: 

Let r be a finite oriented graph with n edges, ni of them parametrized by 
(— 00 , 0], n 2 parametrized by [0, oo), and the remaining ones by [0, 1]. We also 
assume that to each edge e, of T, there is associated a Morse-Smale-Floer 
function /, and that the generalized Morse-Smale-Floer condition holds for 
this collection fi, ... . f n . 

Definition 6.8.1 A continuous map x : T — > X is called a solution of the 
graph flow for the collection (/i, . . . , /„) if 

x(t) = —grad for t € e*. (6.8.1) 


Again, the continuity requirement is relevant only at the vertices of r as 
the flow is automatically smooth in the interior of each edge. If pi, . . . ,p ni 
are critical points for the functions resp. corresponding to the 

edges ei, . . . , e ni parametrized on (— oo, 0], p ni + 1 , . . . ,p ni +n 2 critical points 
corresponding to the edges e ni +i, . . . , e ni +„ 2 resp. parametrized on [0, oo), 
we let M.p lt ... )P + be the space of all solutions of (6.8.1) with 

lim x{t) = pi for i = 1, , ni 

teei 

lim x{t) = Pi for i = n\ + 1, . . . , ni + n 2 , 

i.e. we assume that on each edge e*, i = 1, . . . ,ni + ri 2 , x{t) asymptotically 
approaches the critical p t of the function fi . 

If X is a compact Riemannian manifold of dimension d, we have 

Theorem 6.8.1 Assume, as always in this §, the generalized Morse-Smale- 
Floer condition. Then p„ 1 +„ 2 a srno °ldi manifold, for all tuples 

(pi, . . . ,p n . 1+7l2 ), where pi is a critical point of fi, with 

ni ni+ri2 

dim Mp u _ iPni+n2 =J2t l (Pi) - viPj) - d{rn - 1) - ddimH^R), 

i = 1 1 

(6.8.2) 

where p(pk) is the Morse index of the critical point pu for the function fk- 

Proof. We simply need to count the dimensions of intersections of the relevant 
stable and unstable manifolds for the edges modeled on [0, 00 ) and (— 00 , 0] 
and the contribution of internal loops. Each unstable manifold corresponding 
to a point Pi, i = n\ + 1, . . . , n\ + ri 2 has dimension d— p(pi). If a submanifold 
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Xi of X is intersected transversally by another submanifold X 2 , then the 
intersection has dimension d— (d — dim A'i) — (d — dim-X^), and this accounts 
for the first three terms in (6.8.2). If we have an internal loop in T, this 
reduces the dimension by d, as the following argument shows: 

Let r be constituted by two ei,e 2 with common end points, and let the 
associated Morse functions be resp. For /j, i = 1,2, we consider the 

graph of the flow induced by that function, i.e. we associate to each x £ X 
the point x*( 1), where Xi is the solution of ii(t) = — grad fi(xi(t)), Xj(0) = x. 
These two graphs for /i and f 2 are then submanifolds of dimension d of 
X x X, and if they intersect transversally, they do so in isolated points, as 
dim(X x X) = 2d. Thus, if we start with a d-dimensional family of initial 
points, we get a finite number of common end points. □ 

Again Xlp 1J>n +n is not compact, but can be compactified by flows 
with broken trajectories on the noncompact edges of F. 

The most useful case of Thm. 6.8.1 is the one where the dimension of 
„ is 0. In that case, „ consists of a finite number 

of continuous maps x : T — > X solving (6.8.1) that can again be given 
appropriate signs. The corresponding sum is denoted by 

n{r-p 1 ,...p ni+n2 ). 


We then define a map 

n i 

q(r) : <g> C*(/i, Z) 


a(/„z) 


i = l j=m ~\- 1 

(Pl ® • • • ®Pni) ^ n(r-,P!, . . . ,p ni+n2 )(p ni + 1 (g) . . . (g )p ni+7l2 ). 


With 

Z) := Hom(C*(fi, Z),Z), 
we may consider q(r) as an element of 

ni ni+n2 

® C*(fi, Z) (g) Z). 

*= 1 j=n 1+1 

With the methods of the previous §, one verifies 

Lemma 6.8.1 dq = 0. 

Consequently, we consider q(r) also as an element of 

ni ni+n-2 

® ® H*(/,;Z). 

*=l j=m+l 

Besides the above example where T had the edges (— oo,0] and [0, oo), 
there are other examples of topological significance: 
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r = [0, oo). Thus, TZi = 0, ri 2 = 1, and with p = p ni = pi, 
dim = d — p(p). 

This is 0 precisely if p{p) = d, i.e. if p is a local maximum. In that 
case q(r ) £ Hd(X\ Z) is the so-called fundamental class of X. 
r consisting of two edges modeled on (— oo,0], and joined by identi- 
fying the two right end points 0. Thus ?ii = 2, n 2 = 0, and 

dim Mp i p2 = /i(pi) + p(p 2 ) - d, 

and this is 0 if p{p 2 ) = d — p{p\). With k := p{pi), thus 

q (r) e H k (x, z) ® H d ~ k (x, z) 

^ Horn {H k (X; Z); H d ~ k (X,Z) 
is the so-called Poincare duality isomorphism. 

r consisting of one edge modeled on (— oo,0], and two ones modeled 
on [0,oo), all three identified at the common point 0. Thus n i = 
1, n 2 = 2, and 

dim Mp ltP2tP3 = fi(pi) - fi(p 2 ) - p(p 3 ). 

Hence, if this is 0, 

q(r) £ ® H k ( K , Z) ® Hj (X, Z ) ® H k _j (X, Z) 

j<k 

~ ® Horn (H j (X,Z) ® H k ~ j {X,Z),H k {X,Z). 

j<k 

We thus obtain a product 

U : H j (X,Z)®H k ~ j (X,Z) -> H k (X,Z), 
the so-called cup product. 

r consisting of one edge (— oo, 0] together with a closed loop based at 
0. In that case 

dim Alp = p{p) — d , 
which vanishes for p{p) = d , i.e. 

q(r) £H d (X,Z). 

This cohomology class is called the Euler class. 
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We are considering solution curves of 


x(t ) + grad f(x(t)) = 0, (6.9.1) 

or more generally of 

x(t) + gr&dF(x(t),t) = 0, (6.9.2) 

and we wish to assign a sign to each such solution in a consistent manner. 

For that purpose, we linearize those equations. We consider a curve x(t) of 
class H 1,2 {R,X) and a section ip(t) of class H 1,2 of the tangent bundle of X 
along x, i.e. tp £ H 1,2 (l R, x*TX). Then, in the case of (6.9.1), the linearization 
is 

V_d_ ^(exp x (t)sip(t))* + grad /(exp x(t) sv?(t))) _ q 

= V_d <p(t) + D v{t) grad f(x(t)) with Vd := V i(t) , 

V the Levi-Civita connection of X, 


and likewise, for (6.9.2), we get 

V £¥>(*) + D ipW gradF(x(t),t). 

We shall thus consider the operator 

Vi + -Dgrad F : H^ 2 {x*TX) - L 2 (x*TX ) 

(6.9.3) 

<p e-> ViV? + H^grad F. 

This is an operator of the form 

V + A : H 1 ' 2 (x*TX) -4 L 2 {x*TX), 

where A is a smooth section of x*EndTX which is selfadjoint, i.e. for each 
ieK, A(t) is a selfadjoint linear operator on T x ( t> X . 

We are thus given a vector bundle E on ]R and an operator 

V + A : H 1,2 (E) -> L 2 (E), 

with A a selfadjoint endomorphism of E. H 1 ' 2 (E) and L 2 (E) are Hilbert 
spaces, and V + A will turn out to be a Fredholm operator if we assume that 
A has boundary values H(±oo) at ±oo. 

Let L : V — > W be a continuous linear operator between Hilbert spaces 
V,W, with associated norms || • ||y, || • \\w resp. (we shall often omit the 
subscripts y,w and simply write || • || in place of || • ||y or || • ||w)- L is called 
a Fredholm operator iff 

(i) Vq := ker L is finite dimensional 



344 6. Morse Theory and Floer Homology 


(ii) W\ := LfV), the range of L, is closed and has finite dimensional 
complement Wq =: coker L , i.e. 

W= Wi© W 0 . 


From (i), we infer that there exists a closed subspace V\ of V with 

V = V 0 ®V U 

and the restriction of L to Vi is a bijective continuous linear operator L~ x : 
Vi -► W\. 

By the inverse operator theorem, 

L~ x : VFi -> Vi 

then is also a bijective continuous linear operator. We put 

ind L := dim Vq — dim Wq 

= dim ker L — dim coker L. 

The set of all Fredholm operators from V to W is denoted by F(V, W ). 

Lemma 6.9.1 F(V, W) is open in the space of all continuous linear operators 
from V to W, and 

ind : F(V, W) -> Z 

is continuous, and therefore constant on each component of F(V,W). 

For a proof, see e.g. J.Jost, X. Li-Jost, Calculus of variations, Cambridge 
University Press, 1998. 

By trivializing E along R, we may simply assume E = R ra , and we thus 
consider the operator 

j t +A(t) : H 1,2 (R,W n ) -> L 2 (R,R"), (6.9.4) 

and we assume that Aft) is continuous in t with boundary values 

^4(±oo) = lim Aft), 

t — >-=hoo 

and that A{— oo) and A(oo) are nondegenerate. In particular, since these 
limits exists, we may assume that 

\\Aft)\\ < const., 

independently of t. For a selfadjoint B G Gl(n,R), we denote by 

KB) 

the number of negative eigenvalues, counted with multiplicity. 
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Lemma 6.9.2 La := fjf + Aft) : U 1,2 (R,R n ) — » L 2 (R,R n ) is a Fredholm 
operator with 

ind La = n(A(— oo)) — p(A( oo)). 

Proof. We may find a continuous map C : R — > Gl(n, R) and continuous 
functions Ai(t), . . . , A„(t)) such that 

Cft)~ l A{t)C{t ) = diag (Ai(i), . . . , A„(t)), Ai (t) < A 2 (<) < •■■ < A„(t), 

i.e. we may diagonalize the selfadjoint linear operators A(f) in a continuous 
manner. By continuously deforming N(t) (using Lemma 6.9.1), we may also 
assume that Aft) is asymptotically constant, i.e. there exists T > 0 with 

Aft) = A{— oo) for t < — T 
Aft) = A(oo) for t >T. 

Thus, Cft), Ai(t), . . . , X n ft) are also asymptotically constant. If s(t) is in 
H 1 ’ 2 , then it is also continuous, and hence if it solves 

j f s{t) + Aft)s{t) = 0, 

then it is also of class C 1 , since = —A(t)s(t) is continuous. On 

(—oo, — T\, it has to be a linear combination of the functions 

e -\i(-oo)t, 

and on [T, oo), it is a linear combination of 

g — Ai(o°)t, i = 

Since a solution on [— T, T ] is uniquely determined by its values at the bound- 
ary points ±T, we conclude that the space of solutions is finite dimensional. 
In fact, the requirement that s be in H 1,2 only allows linear combinations 
of those exponential functions of the above type with A$(— oo) < 0, on 
(—oo, — T), and likewise we get the condition Aj(oo) > 0. Thus 

dimkerL^ = max(^(A(— oo)) — /x(A(oo)), 0) 

is finite. 

Now let a £ L 2 (R,R") be in the orthogonal complement of the image of 
L a , i.e. 

■a(t)dt = 0 for all s G i? 1,2 (R, R"), 

where the denotes the Euclidean scalar product in R n . In particular, this 
relation implies that the weak derivative icrft) equals —A(t)a(t), hence is in 
L 2 . Thus a £ U 1,2 (R,R") is a solution of 


/ (fpd) + mm) 
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j f a(t) - A(t)a(t ) = 0. 

In other words, La has — L_a as its adjoint operator, which then by the 
above argument satisfies 

dimkerL_A = max(/i(— A(— oo)) — /i(— A(oo)), 0) 

= max(/x(A(oo)) — fJ,(A(— oo)), 0). 

La then has as its range the orthogonal complement of the finite dimensional 
space ker L_a, which then is closed, and 

ind La = dim ker La — dim coker La 
= dim ker La — dim ker L_ a 
= n(A(-oo)) - n{A{ oo)). 

□ 

Corollary 6.9.1 Let xi,x 2 be H 1,2 curves in X, Ei vector bundles along 
Xi, A.i continuous selfadjoint sections of End-Ej, i = 1,2, with Xi(oo) = 
X2 ( — oo) , Ei(oo) = E 2 ( — oo), ^ 4 i(oo) = A 2 (— oo). We assume again that 
A\(— oo), Ai(oo) = A 2 (— oo), ^2(00) are nondegenerate. We consider diffeo- 
morphisms 

<7 1 : (—00, 0 ) — > K, cr 2 : ( 0 , 00) — > M, 

with <J t (t) = t for |t| > T for some T > 0, i = 1,2, and consider the curve 

{ X\ (cr 1 (t)) for t < 0 

xi(oo) = X 2 (— 00 ) for t = 0 
* 2 (c 2 (t)) for f > 0 

with the corresponding bundle Eft) and Aft ) glued together from E\, E 2 , 
A 1 , A 2 f resp. in the same manner. Then 

ind La = ind La x + ind La 2 ■ 


Proof. 

ind Lai + ind La 2 = fi(Ai(—oo)) — /x(^4i(oo)) + n(A 2 (— 00 )) — p(A 2 ( 00 )) 

= h(A(-oo)) - n(A(o o)) 

= ind La, by Lemma 6.9.2 and construction. 

□ 

We now need to introduce the notion of the determinant of a Fredholm 
operator. In order to prepare that definition, we first let V, W be finite di- 
mensional vector spaces of dimension ?n, equipped with inner products, and 
put 
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Det V := A m (V), with A°V := R. 

Then (Det V)* ® Det V is canonically isomorphic to R via v* ® vj i— > v*(w). 
A linear map 

/ : V -> W 

then induces 


det l : Det V — > Det W, 
i.e. 

det / e (Det V)* 0 Det W. 

The transformation behavior w.r.t. bases ei, . . . , e m of V, fi , . . . , f m of W is 
given by 

det l(e i A ... A e m ) = lei A . . . A le m —■ Afi A ... A f m . 


We may e.g. use the inner product on W to identify the orthogonal comple- 
ment of l(V) with coker /. The exact sequence 

0 — > ker l — > V W — * coker l — »■ 0 


and the multiplicative properties of det allow the identification 

(Det V)* ® Det W = (Det ker/)* ® Det (coker l) =: Det l. 

This works as follows: 

Put Vo = ker Z, Wq = coker L (= /(P)- 1 ), and write V = Vo ® V), W = 
Wo ® Wi. Then 

h '■= l\Vx '■ Vi — >■ Wi, 

is an isomorphism, and if ei, . . . , ej, is a basis of Vo, e^+i, . . . e m one of Vi, 
fi , . . . fk one of Wo, and if we take the basis Zefc+i, . . . , le m of Wi, then 

(ei A . . . e k A e k+1 A ... A e m )* 0 (/i A . . . A f k A le k+1 A ... A le m ) 

is identified with 

(ei A ... A e m ) ® (/i A ... A fm')- 

According to the rules of linear algebra, this identification does not depend 
on the choices of the basis. In this manner, we obtain a trivial line bundle 
over V* ® W, with fiber (Det V)* 0 Det W = (Det ker/)* ® Det coker / over 
/. det / then is a section of this line bundle, vanishing precisely at those / that 
are not of maximal rank m. On the other hand, if / is of maximal rank, then 
(Det ker /)* ® Det coker / can be canonically identified with R, and det / with 
1 € R, by choosing basis ei, . . . , e m of V and the basis le i, . . . le m of W, as 
above. 

In a more abstract manner, this may also be derived from the above exact 
sequence 

0 — > ker / — > V W — > coker / — » 0 
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on the basis of the following easy algebraic 

Lemma 6.9.3 Let 0 — > Vi V 2 • • • ~^Vk — > 0 be an exact sequence of 
linear maps between finite dimensional vector spaces. Then there exists a 
canonical isomorphism 

© A max Vi ^ <g> M max K;. 

i odd i even 


One simply uses this Lemma plus the above canonical identification 
(Det V)* © Det V =* K. 

Suppose now that V, W are Hilbert spaces, that Y is a connected topolog- 
ical space and that l y £ F(V. W) is a family of Fredholm operators depending 
continuously on y £ Y. Again, we form the determinant line 

Det l y := (Det ker/ y )* (g ) (Det coker l y ) 

for each y. We intend to show that these lines (Det l y ) y eY constitute a line 
bundle over Y. 

l y : (ker ly) 1 - — » (coker ly) 1 - 
v 1 — > l y v 

is an isomorphism, and 

ind l y = dim ker l y — dim coker l y 

is independent of y £ Y, as Y is connected. For y in a neighborhood of some 
2/o € Y, let Vy C V be a continuous family of finite dimensional subspaces 
with ker l y C V y for each y, and put 

Wy := l y (Vy) © coker l y . 

Then as above 

(Det Vy)* © Det W' y = (Det ker l y )* © Det coker l y . 

The point now is that this construction is independent of the choice of V y in 
the sense that if V” is another such family, we get a canonical identification 

(Det V”)* © Det W” S (Det V^)* © Det W'. 

Once we have verified that property, we can piece the local models (Det 1©)*© 
Det Wy for Det l y unambiguously together to get a line bundle with fiber 
Det l y over y on Y. 

It suffices to treat the case 


V' C V" 

Vy y- V y , 


and we write 
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y'y = K © v y , 

and 

W v = W 'y® W v 

l y V y Wy is an isomorphism, and 

det l y : Det V y — > Det W y 

yields a nonvanishing section Ai y of (Det V y )* ® Det W y . We then get the 
isomorphism 

(Det V')* <g> Det W' y -► (Det V')* ® Det Wy <g> (Det tg* ® Det Wy 
^ (Det ¥")* ® (Det W'') 

and this isomorphism is canonically determined by l y . 

We have thus shown 

Theorem 6.9.1 Let (l y ) y eY C F(V,W) be a family of Fredholm operators 
between Hilbert spaces V, W depending continuously on y in some connected 
topological space Y. Then we may construct a line bundle over Y with fiber 

Det ly = (Det ker^)* ® (Det coker l y ) 

over y, and with a continuous section det l y vanishing precisely at those y GY 
where ker l y Y 0. 

Definition 6.9.1 Let l = (l y )( y eY) C Fty,W) be a family of Fredholm 
operators between Hilbert spaces V, W depending continuously on y in some 
connected topological space Y . An orientation of this family is given by a 
nowhere vanishing section of the line bundle Det l of the preceding theorem. 

If ker l y = 0 for all y G Y, then of course det l y yields such a section. If 
this property does not hold, then such a section may or may not exist. 

We now wish to extend Cor. 6.9.1 to the determinant lines of the operators 
involved, i.e. we wish to show that 

det La — det La x ® det La 2 ■ 

In order to achieve this, we need to refine the glueing somewhat. We again 
trivialize a vector bundle E over R, so that E becomes R x R n . Of course, one 
has to check that the subsequent constructions do not depend on the choice 
of trivialization. 

We again consider the situation of Cor. 6.9.1, and we assume that Ai, A 2 
are asymptotically constant in the sense that they do not depend on t for 
|t| > T, for some T > 0. For r G R, we define the shifted operator L t Ai via 
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LaA 1 ) = fff + A i(t-T)s(t). 

As we assume A\ asymptotically constant, A[(t) := Ai(t+r) does not depend 
on t over [— l,oo) for r sufficiently large. Likewise, Af T (t) does not depend 
on t over (— 00 , 1] for r sufficiently large. We then put 


A(t) Aiff T A 2 (t) 


Ai(t + r) forfG (—oo,0] 
A 2 (t — t ) for t £ [0, 00 ) 


and obtain a corresponding Fredholm operator 


l a 1 # t a 2 - 


Lemma 6.9.4 For r sufficiently large, 


Det Lai# t a 2 — Det L Al (g> Det La 2 - 


Sketch of Proof . We first consider the case where L Al and La 2 are surjective. 
We shall show 


dimkerL^ < dimkerL^ +dimkerL J 4 2 (6.9.5) 

which in the surjective case, by Cor. 6.9.1 equals 

ind La 1 + ind La 2 = ind La 

< dim ker La , 


hence equality throughout. 

Now if s T (t) £ ker L Ai # t a 2 , we have 

+ A(t)s T (t) = 0, (6.9.6) 

and we have 

A(t) = Ai(oo)(= A 2 (-oo)) 

for |t| < r, for arbitrarily large T, provided r is sufficiently large. Since Ai(oo) 
is assumed to be nondegenerate, the operator 


d 

dt 


Ai(oo) 


is an isomorphism, and thus, if we have a sequence 


($T n )neN 

of solutions of (6.9.6) for r = r n , with ||s Tn ||#i ,2 < 1, r n — > 00 , then 


0 on [—T,T], for any T > 0. 
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On the other hand, for t very negative, we get a solution of 
^ s T (t ) + Ai(-oo)s t (<) = 0, 

or more precisely, s T (t — r) will converge to a solution of 

+ Ai(f)s(i) = 0, 
clt 

i.e. an element of kerL^. Likewise s T (t + r) will yield an element of kerL^. 
This shows (6.9.5). 

If La i, La 2 are not necessarily surjective, one finds a linear map A : — > 

L 2 (K,]R") such that 

L Ai +A: H 1,2 (R, R") x R k -> L 2 (K,K”) 

(s, v ) e- > + Av 

are surjective for i = 1,2. One then performs the above argument for these 
perturbed operators, and observes that the corresponding determinants of 
the original and the perturbed operators are isomorphic. □ 

We now let Y be the space of all pairs (x,A), where x : R. — > X is a 
smooth curve with limits x(±oo) = lim^-too x(t) £ X, and A is a smooth 
section of a;*EndTW for which Aft) is a selfadjoint linear operator on T x t t )X, 
for each ieR, with limits ^4(±oo) = lim^-too Aft) that are nondegenerate, 
and for each y £ ( x , A) £ Y, we consider the Fredholm operator 

L (x , A ) ■= V + A : H 1,2 (x*TX) -> L 2 {x*TX) 

Lemma 6.9.5 Suppose X is a finite dimensional orient-able Riemannian 
manifold. Let (xi,A{), (zi,^) £ Y satisfy a>i(±oo) = ^(±00), ^ 4 i(±oo) = 
^(± 00 ). Then the determinant lines Det L^ Xl Al ) omd Det L( X2 a 2 ) can be 
identified through a homotopy. 

Proof. We choose trivializations cr; : x*TX ->tx K™ (n = dim X) extend- 
ing continuously to ±oo, for i = 1,2. Thus, L/ Xi ^A i ) is transformed into an 
operator 

L a . = — + AAt) : H 1,2 (R,R n ) -> L 2 (M,K") 

* dt 

(with an abuse of notation, namely using the same symbol Ajft) for an en- 
domorphism of T x ( t )X and of K" = (Ji{t)(T x t t ^X)). Since X is orientable, we 
may assume that 

<Ti(±oo) = a 2 (Too) 

(for a nonorientable X, we might have cri(— oo) = er 2 (~ oo), but <Ti(oo) = 
— 0 - 2 ( 00 ), or vice versa, because GL(n,M) has two connected components, but 
in the orientable case, we can consistently distinguish these two components 
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acting on the tangent spaces T x X with the help of the orientations of the 
spaces T x X). Thus, the relations Ai(±oo) = A 2 (±oo) are preserved under 
these trivializations. 

From the proof of Lemma 6.9.2, ind L Al = ind La 2 , and coker L Ai = 0 
or kerL^i = 0, depending on whether ±/i(Aj(— oo)) > ±/z(Aj(oo)). It then 
suffices to consider the first case. Since the space of all adjoint endomorphisms 
of R" can be identified with R. 2 (the space of symmetric (nxn) matrices), 
we may find a homotopy between Ai and A 2 in this space with fixed endpoints 
Ai(±oo) = A 2 (Too) . As a technical matter, we may always assume that 
everything is asymptotically constant as in the proof of Lemma 6.9.2, and 
that proof then shows that such a homotopy yields an isomorphism between 
the kernels of La 1 and La 2 ■ □ 

Thus, Fredholm operators with coinciding ends at ±00 as in Lemma 6.9.5 
can be consistently oriented. Expressed differently, we call such operators 
equivalent, and we may define an orientation on an equivalence class by 
choosing an orientation of one representative and then defining the orien- 
tations of the other elements of the class through a homotopic deformation 
as in that lemma. 

Definition 6.9.2 An assignment of an orientation a(x,A ) to each equiva- 
lence class ( x,A ) is called coherent if it is compatible with glueing, i.e. 

a((xi,A 1 )ff(x 2 , A 2 )) = a(xi,Ax) ® er(;r 2 , A 2 ) 

(assuming, as always, the conditions required for glueing, i.e. xi(oo) = 
x 2 {— 00 ), Ai(oo) = A 2 (— 00 )). 


Theorem 6.9.2 Suppose X is a finite dimensional orientable Riemannian 
manifold. Then a coherent orientation exists. 

Proof. We first consider an arbitrary constant curve 
x(t) = xq € X, A(t) = A 0 . 

The corresponding Fredholm operator 

L Ao = j t + A 0 : ff 1,2 (R, T Xo X) - L 2 (T Xo X) 

then is an isomorphism by the proof of Lemma 6.9.2, or an easy direct argu- 
ment. Thus, Det La 0 is identified with R®R*, and we choose the orientation 
1 ® 1* We next choose an arbitrary orientation for each class of 

operators L^ x a) different from L^ Xq a 0 ) with 


x(— 00 ) = xq, A{— 00 ) = Aq 
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(note that the above definition does not require any continuity e.g. in A(oo)). 
This then determines orientations for classes of operators L( x ,a) with 

x(oo) = xq, A(oo) = Aq, 

because the operator Lr x -i, A -i\, with x _1 (f) := x(—t), A _1 (t) := A(—t), 
then is in the first class, and 

L(x-\a-')#L( X}A) is equivalent to L( Xo ,a o) , 

and by Lemmas 6.9.4 and 6.9.5 

Det L( x - i,a-i) ® Det L( Xi a) = Det L( Xo ,a 0 )- 

Finally, for an arbitrary class L( x a), we find (xi, A^) and (x 2 ,A 2 ) with 

x\{— 00) = xo, A\(— 00) = xq, xi(oo) = x{— oo), Ai(oo) = A{— 00), 

£2(00) = xq, A 2 ( 00) = x 0 , x 2 {— 00) = x(oo), A 2 {— 00) = A(oo). 

and the glueing relation 

L (x 1 ,A 1 )#L( x , A )#L( X2t A 2 ) equivalent to L (x0i a 0 )- 
The relation of Lemma 6.9.4, i.e. 

Det L^ XlAl ^ 0 Det L^ X} a) 0 Det L( X2 i a 2 ) = Det L^ Xo Ao ^ 
then fixes the orientation of L^ xA y □ 

We shall now always assume that X is a compact finite dimensional, 
orientable Riemannian manifold. According to Thm. 6.9.2, we may assume 
from now on that a coherent orientation on the class of all operators L( x ,a) 
as above has been chosen. 

We now consider a Morse-Smale-Floer function 

/ : X -► M 

as before, and we let p, q G X be critical points of / with 

Mp) - M?) = 1- 

Then for each gradient flow line x(t) with x(— 00) = p, x(oo) = q , i.e. 

x(t) + grad/(x(t)) = 0, 

the linearization of that operator, i.e. 

L := V x(t) + d 2 f(x(t )) : H 1,2 (x*TX) - L 2 {x*TX) 

is a surjective Fredholm operator with one-dimensional kernel, according to 
Lemma 6.9.2 and its proof. However, we can easily find a generator of the 
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kernel: as the equation satisfied by x{t) is autonomous, for any r 0 £ K, 
x(t + r) likewise is a solution, and therefore x(t) must lie in the kernel of 
the linearization. Altogether, x(t ) defines an orientation of Det L , called the 
canonical orientation. 

Definition 6.9.3 We assign a sign n(x(t )) = ±1 to each such trajectory of 
the negative gradient flow of / with fi(x(— oo)) — n(x(oo)) = 1 by putting n = 
1 precisely if the coherent and the canonical orientation for the corresponding 
linearized operator V + d 2 f coincide. 

This choice of sign enables us to take up the discussion of § 6.6 and define 
the boundary operator as 

dp= ^2 n(s)r, 

rec*(f) 
m(t') = A‘(p)- 1 

SGM l,r 

now with our present choice of sign. Again, the crucial point is to verify the 
relation 

<9 2 = 0. 

As in Thm. 6.5.1, based on Thm. 6.3.1, we may again consider a component 
M of M f p q (p, q critical points of / with p(p) — p{q) = 2), homeomorphic to 
the open disk. We get a figure similar to Fig. 6.6.1 


p 



Fig. 6.9.1. 

On the flow line x(t) from p to q , we have indicated a coherent orientation, 
chosen such that e\ corresponds to the negative flow line direction, and e -2 cor- 
responds to an arbitrarily chosen orientation of the one-dimensional manifold 
/ _1 (a) 0 Ad, where f{q) < a < f(p ), as in § 6.6. The kernel of the associ- 
ated Fredholm operators L x is two-dimensional, and e\ A e-i then induces an 
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orientation of Det L x . The coherence condition then induces corresponding 
orientations on the two broken trajectories from p to q, passing through the 
critical points ri,r% resp. In the figure, we have indicated the canonical ori- 
entations of the trajectories from p to rq and r 2 and from r± and r 2 to q. Now 
if for example the coherent orientations of the two trajectories from p to rq 
and r 2 , resp. both coincide with those canonical orientations, then this will 
take place for precisely one of the two trajectories from rq and r 2 resp. to q. 
Namely, it is clear now from the figure that the combination of the canoni- 
cal orientations on the broken trajectories leads to opposite orientations at 
q , which however is not compatible with the coherence condition. From this 
simple geometric observation, we infer the relation do d = 0 as in § 6.6. 

We may also take up the discussion of § 6.7 and consider a regular ho- 
motopy (as in Def. 6.7.1) F between two Morse functions f 1 , / 2 , and the 
induced map 

0 21 :a(/\z)^a(/ 2 ,z). 

In order to verify the relationship 

</> 21 o Of 1 = df 2 o (j) 21 (6.9.7) 

with the present choice of signs, we proceed as follows. If p\ is a critical point 
of f 1 , p 2 one of / 2 , with 

Kpi) = M(P2), 

and if s : R. — > X with s(— oo) = p\, s(oo) = p 2 satisfies (6.7.4), i.e. 

s(t) = —grad F(s(t),t), (6.9.8) 

we consider again the linearized Fredholm operator 

L s := V + d 2 F : H l ' 2 (s*TX) -► L 2 (s*TX). 

Since p(p\) = p(p 2 ), Lemma 6.9.2 implies 

ind L s = 0. 

Since by definition of a regular homotopy, L s is surjective, we consequently 
get 

ker L s = 0. 

Thus, Det L s is the trivial line bundle and we may orient it by 1® 1*, 

and we call that orientation again canonical. Thus, we may assign a sign n(s) 
to each trajectory from pi to p 2 solving (6.9.8) as before by comparing the 
coherent and the canonical orientations. Now in order to verify (6.9.7), we 
look at Fig 6.9.2. Here, we have indicated a flow line w.r.t. / 1 from pi to 
another critical point rq of / 1 with p(pi) — p(r i) = 1, and likewise one w.r.t. 
f 2 from p 2 to r 2 with p{p 2 ) — p{r 2 ) = 1, both of them equipped with the 
canonical orientations as defined above for the relative index 1. Since now the 
solution curves of (6.9.8) from p\ to p 2 , and likewise from r\ to r 2 carry the 
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orientation of a trivial line bundle, we may choose the coherent orientations 
so as to coincide with the canonical ones. 



Fig. 6.9.2. 


We now compute for a critical point p\ of / 1 with p(pi) = (3, and with 
the space of solutions of (6.9.8) from p\ to P 2 , 


= 9/ 2 ( E E n ^ P2 I “ ’ 


^m(P2)=/3 sGM£ 1>P2 


E E n ( s i) r i 

\^(n)=/3-l ai& Mil, ri 


= E 

At(r 2 )=/3-l 


( 

E E E n ( s )^( s 2) 

y(P2)=p S2 GM f p 2 2 , r2 


E EE n (si)«( s ') r 2- 

M(ri)=/3-l SieJM /; ri s'£Al^ r2 J 


Again, as in Thm. 6.5.1, trajectories occur in pairs, but the pairs may be of 
two different types: within each triple sum, we may have a pair (s^, s^) 
and (s*- 2 -*, s^ 2) ), and the two members will carry opposite signs as we are then 
in the situation of Fig. 6.9.1. The other type of pair is of the form (s, s 2 ) and 
(si, s ' ), i.e. one member each from the two triple sums. Here, the two members 
carry the same sign, according to the analysis accompanying Fig. 6.9.2, but 
since there are opposite signs in front of the two triple sums, we again get a 
cancellation. 

In conclusion, all contributions in the preceding expression cancel in pairs, 
and we obtain 

d f 2 o q b 21 - (j) 21 o df 1 = 0, 
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as desired. We thus obtain 


Theorem 6.9.3 Let X be a compact, finite dimensional, orientable Rieman- 
nian manifold. Let J 1 ,/ 2 be Morse-Smale-Floer functions, and let F be a 
regular homotopy between them. Then F induces a map 

4> 21 : C*(f l ih) — > C*(/ 2 ,Z) 


satisfying 


d o (ft 21 = f> 21 o d, 


and hence an isomorphism of the corresponding homology groups defined by 
f 1 and f 2 , resp. □ 


Corollary 6.9.1 Under the assumptions of Thm. 6.9.3, the numbers bk(X, f) 
defined at the end of % 6.6 do not depend on the choice of a Morse-Smale- 
Floer function f and thus define invariants bk(X) of X . □ 

Definition 6.9.4 The numbers bp-(X) are called the Betti numbers of X 


Remark. The Betti numbers have been defined through the choice of a 
Riemannian metric. In fact, however, they turn out not to depend on that 
choice. See the Perspectives for some further discussion. 


Perspectives. The relative approach to Morse theory presented in this chapter 
was first introduced by Floer in [74]. It was developed in detail by Schwarz[221], 
and starting with § 6.4 we have followed here essentially the approach of Schwarz 
although in certain places some details are different (in particular, we make a more 
systematic use of the constructions of § 6.3), and we cannot penetrate here into all 
the aspects worked out in that monograph. An approach to Floer homology from 
the theory of hyperbolic dynamical system has been developed in [250]. We also 
refer the reader to the bibliography of [221] for an account of earlier contributions 
by Thom, Milnor, Smale, and Witten. (Some references can also be found in the 
Perspectives on § 6.10.) 

In particular, Witten[256], inspired by constructions from supersymmetry, es- 
tablished an isomorphism between the cohomology groups derived from a Morse 
function and the ones coming from the Hodge theory of harmonic forms as devel- 
oped in Chapter 2 of the present work. 

In some places, we have attempted to exhibit geometric ideas even if consider- 
ations of space did not allow the presentation of all necessary details. This applies 
for example to the § 6.8 on graph flows which is based on [22]. As in Schwarz’ 
monograph, the construction of coherent orientations in § 6.9 is partly adapted 
from Floer, Hofer[75]. This in turn is based on the original work of Quillen[204] on 
determinants. 

The theory as presented here is somewhat incomplete because we did not de- 
velop certain important aspects, among which we particularly wish to mention the 
following three: 
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1) Questions of genericity: 

A subset of a Baire topological space is called generic if it contains a count- 
able intersection of open and dense sets. In the present context, one equips 
the space of (sufficiently smooth) functions on a differentiable manifold X 
as well as the space of Riemannian metrics on X with some C k topology, 
for sufficiently large k. Then at least if X is finite dimensional and com- 
pact, the set of all functions satisfying the Morse condition as well as the 
set of all Riemannian metrics for which a given Morse function satisfies the 
Morse-Smale-Floer condition are generic. 

2) We have shown (see §§ 6.7, 6.9) that a regular homotopy between two 
Morse functions induces an isomorphism between the corresponding ho- 
mology theory. It remains to verify that this isomorphism does not depend 
on the choice of homotopy and is flow canonical. 

3) Independence of the choice of Riemannian metric on A' : We recall that by 
Lemma 1.5.1, a Riemannian metric on A' is given by a symmetric, positive 
definite covariant 2-tensor. Therefore, for any two such metrics go,gi and 
0 < t < l,gt := tgo + (1 — t)g\ is a metric as well, and so the space of all 
Riemannian metrics on a given differentiable manifold is a convex space, in 
particular connected. If we now have a Morse function /, then the gradient 
flows w.r.t. two metrics go,gi can be connected by a homotopy of metrics. 
The above linear interpolation gt may encounter the problem that for some 
t, the Morse-Smale-Floer transversality condition may not hold, and so one 
needs to consider more general homotopies. Again, for a generic homotopy, 
all required transversality conditions are satisfied, and one then conclude 
that the homology groups do not depend on the choice of Riemannian met- 
ric. Thus, they define invariants of the underlying differentiable manifold. 
In fact, they are even invariants of the topological structure of the manifold, 
because they satisfy the abstract Eilenberg-Steenrood axioms of homology 
theory, and therefore yield the same groups as the singular homology theory 
that is defined in purely topological terms. 

These points are treated in detail in [221] to which we consequently refer. 

As explained in this chapter, we can also use a Morse function to develop a 
cohomology theory. The question then arises how this cohomology theory is related 
to the de Rham-Hodge cohomology theory developed in Chapter 2. One difference 
is that the theory in Chapter 2 is constructed with coefficients R, whereas the 
theory in this Chapter uses Z 2 and Z as coefficients. One may, however, extend 
those coefficients to R as well. Then, in fact, the two theories become isomorphic 
on a compact differentiable manifolds, as are all cohomology theories satisfying the 
Eilenberg-Stennrod axioms. These axioms are verified for Morse-Floer cohomology 
in [221]. The background in algebraic topology can be found in [229]. Witten[256] 
derived that isomorphism in a direct manner. For that purpose, Witten considered 
the operators 

dt := de tf , 

their formal adjoints 

df = e tf d*e~ t ^ 

and the corresponding Laplacian 

■At '■= dtd t + d t dt- 

For t = 0, Aq is the usual Laplacian that was used in chapter 2 in order to develop 
Hodge theory and de Rham cohomology, whereas for t — > 00 , one has the following 
expansion 
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/it 


dd + d d + t \\df\\ 


d 2 h 

dx k dxi 



where (gf j)j_ 1 „ is an orthonormal frame at the point under consideration. This 
becomes very large for t — > oo, except at the critical points of /, i.e. where df — 0. 
Therefore, the eigenfunctions of At will concentrate near the critical points of / for 
t — > oo, and we obtain an interpolation between de Rham cohomology and Morse 
cohomology. 

An elementary discussion of Morse theory, together with applications to closed 
geodesics, can be found in [182]. 

Finally, as already mentioned, Conley developed a very general critical point 
theory that encompasses Morse theory but applies to arbitrary smooth functions 
without the requirement of nondegenerate critical points. This theory has found 
many important applications, but here we have to limit ourselves to quoting the 
references Conley[50], Conley and Zehnder[51]. In another direction, different ap- 
proaches to Morse theory on singular (stratified) spaces have been developed by 
Goresky and MacPherson[90] and Ludwig[172]. 


6.10 The Morse Inequalities 

The Morse inequalities express relationships between the Morse numbers /i,;, 
defined as the numbers of critical points of a Morse function / of index i, and 
the Betti numbers 6; of the underlying manifold X. In order to simplify our 
exposition, in this §, we assume that X is a compact Riemannian manifold, 
and we only consider homology with Z 2 -coefficients (the reader is invited 
to extend the considerations to a more general setting). As before, we also 
assume that / : X — » R. is of class C 3 and that all critical points of / are 
nondegenerate, and that (A, /) satisfies the Morse-Smale-Floer condition. 

As a preparation, we need to consider relative homology groups. Let A 
be a compact subset of X, with the property that flow lines can enter, but 
not leave A. This means that if 

x(t ) = — grad f(x(t)) for t G K 

and 

x(to) € A for some tjeRU {— oo}, 

then also 

x(t) € A for all t > t 0 . 

We obtain a new boundary operator d A in place of d by taking only those 
critical points of / into account that lie in X\A. Thus, for a critical point 
p € A\A, we put 

d A p:= ^2 (#z 2 M S p r) r - 

rec.(/)nx\4 

fj,(p,r) = l 


( 6 . 10 . 1 ) 
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By the above condition that flow lines cannot leave A once they hit it, all 
flow lines between critical points p,r £ X\A are entirely contained in X\A 
as well. In particular, as in Thm 6.5.2, we have 

d A ■ d A p = 0 for all critical points of / in X\A. (6.10.2) 

Defining C A {f, Z 2 ) as the free Abelian group with Z 2 -coefficients generated 
by the critical points of / in X\A, we conclude that 

(C A (f, Z 2 ),d A ) 

is a chain complex. We then obtain associated homology groups 

H k (X, A, /, Z 2 ) := 7 (6.10.3) 

image d£ +1 

as in § 6.5. 

We shall actually need a further generalization: Let A CY C X be compact, 
and let / : X — > R. satisfy: 

(i) If the flow line x(t), i.e. 

x(t) = —grad f(x(t)) for all t, 

satisfies 

x(to) £ A for some to€iU {— oo}, 

then there is no t > to with x(t) £ F\A. 

(ii) If the flow line x(t) satisfies 

O 

x(ti) £ Y, x(t 2 ) £ X\Y, with — oo < t\ < t 2 < oo, 
then there exists t\ < t 0 < t 2 with 

x(t 0 ) £ A. 

Thus, by (i), flow lines cannot reenter the rest of Y from A , whereas by (ii), 
they can leave the interior of Y only through A. If p £ Y\A is a critical point 
of /, we put 

d Y ’ A p~ ^2 (#z 2 -Ad£ r ) r. (6.10.4) 

reC*(f)nY\A 
^(p,r) = l 

Again, if p and r are critical points in Y\A, then any flow line between them 
also has to stay entirely in Y\A, and so as before 

d YA o d YA = 0, (6.10.5) 

and we may define the homology groups 

ker 8 Y ' A 

H k (Y, A, /, Z 2 ) := ^ (6.10.6) 

image o fc J hl 
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now apply these constructions in three steps: 

Let p be a critical point of / with Morse index 

p{p) = k. 

We consider the unstable manifold 


W u {p) = {x(-) flow line with x(— oo) = p}. (6.10.7) 

As the parametrization of a flow line is only defined up to an additive 
constant, we use the following simple device to normalize that con- 
stant. It is easy to see, for example by Thm. 6.3.1, that for sufficiently 
small e > 0 ,W u (jp) intersects the sphere dB(p,e) transversally, and 
each flow line in W u (p) intersects that sphere exactly once. We then 
choose the parametrization of the flow lines x(-) in W u (p) such that 
x(0) always is that intersection point with the sphere dB(p,e). Hav- 
ing thus fixed the parametrization, for any T £ R, we cut all the flow 
lines off at time T: 


Yp \= {x(t) : —oo < t < T,x(-) flow line in W u (p)} (6.10.8) 


and 

Ap := {x(T) : x(-) flow line in W u (p)}. (6.10.9) 

It is easy to compute the homology H^(Y^ , Aj, /, Z 2 ) : p is the only 
critical point of / in Y y \A y , and so 

d Y p’ A p p= o. (6.10.10) 


y 

Thus, the kernel of d k p ’ p is generated by p. All the other kernels and 


-A» > A - 


images of the d p p are trivial and therefore 


Hj (Xp , Ap , /, Z 2 ) = 


Z 2 if j = k 
0 otherwise , 


(6.10.11) 


for all T eR. 

Thus, the groups Hj(Y^,Ap,f, Z 2 ) encode the local information ex- 
pressed by the critical points and their indices. No relations between 
different critical points are present at this stage. Thus, for this step, 
we do not yet need the Morse-Smale-Floer condition. 

We now wish to let T tend to 00 , i.e. to consider the entire unstable 
manifold W u (p). W u (p), however, is not compact, and so we need to 
compactify it. This can be done on the basis of the results of §§ 6.4, 
6.5. Clearly, we need to include all critical points r of f that are end 
points of flow lines in W u {p), i.e. 


r = x(oo) for some flow line x(-) in W u (p). 
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In other words, we consider all critical points r to which p is connected 
by the flow in the sense of Def. 6.5.2. In particular, for any such r 

p{r) < p(p), 

because of the Morse-Smale-Floer condition, see (6.5.2). Adding those 
critical points, however, is not yet enough for compactifying W u (p). 
Namely, we also need to add the unstable manifolds W u (r) of all those 
r. If the critical point q is the asymptotic limit y{ oo) of some flow line 
y(-) in W u (r), then, by Lemma 6.5.2, we may also find a flow line x(-) 
in W u (p) with x(oo) = q , and furthermore, as the proof of Lemma 
6.5.2 shows, the flow line y(-) is the limit of flow lines x(-) from W u (p). 
Conversely, by Thm. 6.4.1, any limit of flow lines £„(•) from W“ (p ) , 
n £ N, is a union of flow lines in the unstable manifolds of critical 
points to which p is connected by the flow, using also Lemma 6.5.2 
once more. As these results are of independent interest, we summarize 
them as 

Theorem 6.10.1 Let f £ C 3 (X, R),A a compact Riemann manifold, be a 
function with only nondegenerate critical points, satisfying the Morse-Smale- 
Floer condition. Let p be a critical point of f with unstable manifold W u (p). 
Then W u (p) can be compactified by adding all the unstable manifolds W u (r) 
of critical points r for which there exists some flow line from p to r, and 
conversely, this is the smallest compactification ofW u (p). □ 

We now let Y be that compactification of W u (p), and A := Y\W u (p), 
i.e. the union of the unstable manifolds W u {r) of critical points r to 
which p is connected by the flow. Again, the only critical point of / 
in Y\A is p, and so we have as in 1) 

Hj(Y,A,f, Z 2 ) = { J 2 if l = (6.10.12) 

1 0 otherwise. 

The present construction, however, also allows a new geometric in- 
terpretation of the boundary operator d. For that purpose, we let 
C'(/, Z 2 ) be the free Abelian group with Z 2 -cocflicients generated by 
the set C((/) of unstable manifolds W u (p) of critical points p of /, 
and _ 

d'W u (p):= Y, (#z*M f P ,r)W u (r). (6.10.13) 

rec,(f) 
f*M=/Up) — i 

Thus, if y(p) = k, the boundary of the fc-dimensional manifold W u (p) 
is a union of (k — l)-dimensional manifolds W u {r). Clearly, & o d' = 0 
by Thm. 6.5.2, as we have simply replaced all critical points by their 
unstable manifolds. This brings us into the realm of classical or stan- 
dard homology theories on differentiable manifolds. From that point 
of view, the idea of Floer then was to encode all information about 
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certain submanifolds of X that generate the homology, namely the 
unstable manifolds W u (p) in the critical points p themselves and the 
flow lines between them. The advantage is that this allows a formula- 
tion of homology in purely relative terms, and thus greater generality 
and enhanced conceptual clarity, as already explained in this chapter. 

3 ) We now generalize the preceding construction by taking unions of 
unstable manifolds. For a critical point p of /, we now denote the 
above compactification of W u {p) by Y (p). We consider a space Y that 
is the union of some such Y (p ) , and a subspace A that is the union 
of some Y(q) for critical points q £ Y . As before, we get induced 
homology groups H k (A), H k (Y), H k (Y, A), omitting / and Z 2 from 
the notation from now on for simplicity. As explained in 2), we may 
consider the elements of these groups as equivalence classes (up to 
boundaries) either of collections of critical points of / or of their 
unstable manifolds. 


We now need to derive some standard facts in homology theory in our 
setting. A reader who knows the basics of homology theory may skip the 
following until the end of the proof of Lemma 6 . 10 . 4 . 

We recall the notation from algebraic topology that a sequence of linear 
maps fj between vector spaces A 3 


... A. 


i + 1 


A, 


A *— i 


is called exact if always 


ker(/i) = image (f i+ 1). 


We consider the maps 


i k : H k {A) - H k (Y) 
j k : H k (Y) - H k (Y,A) 
d k : H k (Y,A) —> H k _i(A) 


defined as follows: 

If 7T € C k ( A) , the free Abelian group with Z2-coefficients generated by the 
critical points of / in A , we can consider n also as an element of C k (Y), from 
the inclusion A c — > Y. If ir is a boundary in C k (A ), i.e. i r = d k+ i"f for some 
7 € C k+ \{A), then by the same token, 7 can be considered as an element of 
C k+ i(Y), and so 7r is a boundary in C k (Y ) as well. 

Therefore, this procedure defines a map i k from H k (A) to H k (Y). 

Next, if 7r £ C k (Y), we can also consider it as an element of C k (Y 1 A ), by 
forgetting about the part supported on A , and again this defines a map j k in 
homology. 

Finally, if 7r £ C k (Y) with dn £ C k -\(A) and thus represents an element 
of H k (Y,A), then we may consider dir as an element of H k _i(A), because 
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d o dir = 0. dir is not necessarily trivial in H k _i(A), because 7r need not be 
supported on A , but dir as an element of H k _i(A) does not change if we 
replace ir by ir + 7 for some 7 € C k ( A) . Thus, dir as an element of H k _i(A) 
depends on the homology class of ir in H k (Y,A ), and so we obtain the map 
dk-.H^Ai^Hk-^A). 

The proof of the following result is a standard routine in algebraic topol- 
ogy: 

Lemma 6.10.1 The sequence 

... H k {A) ^ H k (Y) ^4 H k (Y,A) -^4 H k ^{A) — ... 


is exact. 

Proof. We denote the homology classes of an element 7 by [7]. 

1) Exactness at H k (A) : 

Suppose [7] € ker i k , i.e. 

ik\l\ = 0. 

This means that there exists ir £ C k +i(Y) with 

dir = i fc ( 7) 


Since i k { 7) is supported on A, ir represents an element of H k+ i(Y, A), and 
so [7] £ image (d k+ i). Conversely, for any such ir, dir represents the trivial 
element in H k (Y), and so i k [dir] = 0, hence [dir\ £ ker i k . Thus i k °d k+ i = 0. 

2) Exactness at H k {Y): 

Suppose [7r] £ ker j k . This means that ir is supported on A , and so [7r] is in 
the image of i k . Conversely, obviously j k o i k = 0. 

3) Exactness at H k (Y,A ) : 

Let [vr ] £ kerd k - Then c>7r = 0, and so ir represents an element in 
H k (Y). Conversely, for any [71-] £ H k (Y),dir = 0, and therefore d k o 
jk = 0- □ 

In the terminology of algebraic topology, a diagram 

A 2 A\ 

f I I 9 

B 2 A B 1 

of linear maps between vector spaces is called commutative if 

g o a = b o /. 

Let now (Yi,Y 2 ) and (Y 2 , Y3) be pairs of the type (Y, A) just considered. We 
then have the following simple result 
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Lemma 6.10.2 The diagram 


H k {Y 2 , y 3 ) fr fc _!(y 2 ) H k _ i(y 2 , y 3 ) 


l -1,2,3 


l -2,3 

i 


I -1,2 

I ** 


l -1,2,3 

i l k- 1 

H k (Fi , y 2 ) ^ H k _ 1 (y 2 ) X fffc-1 (yi) ^ (y , y 2 ) 


where the vertical arrows come from the inclusions Y 3 c — > F 2 c ~* Li . and where 
superscripts indicate the spaces involved, is commutative. 


Proof. Easy; for example, when we compute i k °d k ’ 3 [ 7r], we have an element 
7T of C k (Y 2 ), whose boundary dn is supported on Y 3 , and we consider that 
as an element of C k -i(Y2). If we apply i k ’ ’ 3 to [7r], we consider tt as an 
element of C k (Y\) with boundary supported on C k -i(Y 2 ), and d\.' 2 [i r] is that 
boundary. Thus i 2 k 3 o d l ’ 3 = d k 2 o i k 2,3 . □ 


Lemma 6.10.3 Let Ys C Y 2 C Y\ be as above. Then the sequence 
... — » H k+ 1 {Y X ,Y 2 ) 3 k ’+l ^?' +1 H k (Y 2 ,Y 3 ) ¥+ H k (Yi, Ys) ^ H k (Y 1 : Y 2 ) 


is exact. 

(Here, the map i k comes from the inclusion Y 2 Y k , whereas j k ’ 
arises from considering an element of C k -i(Yi,Y 3 ) also as an element of 
C k -i(Yi,Y 2 ) (since Y 3 C Y 2 ), in the same way as above). 


Proof. Again a simple routine: 

T) Exactness at H k {Y 2 , Y 3 ): 

i k [if\ = 0 77 G C k+ 1 (Y U Y 3 ) : cby = 7 r, and in fact, we may consider 7 as 
an element of C k+ \(fY\,Y 2 ) as the class of 7r in H k (Y 2 ,Y 3 ) is not influenced 
by adding dui for some w G C k+ \{Y 2 ). Thus 7r is in the image of j k+[ o d k ' +1 . 

2 ) Exactness at H k (Y\, I3): 

j k [tt\ = 0 77 G C*+i(yi, y 2 ) : c?7 = 7r, and so tt is trivial in homology up 
to an element of C k (Y 2 , ¥3), and so it is in the image of i k . 

3 ) Exactness at H k (Y\, Y 2 ): 

j k ’ 3 0 ®k[ n ] = 0 ^ d k TT vanishes up to an element of C k -\(Y 3 ) 

77 7r is in the image of j k . 


□ 


Finally, we need the following algebraic result: 

Lemma 6.10.4 Let 

... — > A 3 A 2 Ai 0 

be an exact sequence of linear maps between vector spaces. 
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Then for all k £ N 

dim A\ — dim A 2 + dim A 3 — ... — (— l) fc dim Ak + (— l) fc dim(kerafc) = 0. 

(6.10.14) 

Proof. For any linear map £ = V — > W between vector spaces, 
dimM = dim(kerL) + dim(image £). 

Since by exactness 

dim(image aj) = dim(kera,_i) 

we obtain 

dim(Aj) = dim(keraj) + dim(ker aj-i). 

Since dim Ay = dimkerai, we obtain 

dim Ai — dim A 2 + dim A 3 — ... + (— l) fe dim(kerafc) = 0. 

□ 

We now apply Lemma 6.10.4 to the exact sequence of Lemma 6.10.3. With 
b k (X,Y) := dim (H k (X,Y)) 

Vk(Yi,Y 2 ,Y 3 ) = dim(kerj ^ 1 od*’ 2 ), 

we obtain 

k 

53(-i) i (6 i (yi,y 2 ) - 6i(yi,y 3 ) + w.yo) 

i= 0 

- (-i) fc i/ fc (Yi,y 2 ,y,) = 0 . 

Hence 

(-1 ) fc - 1 ^_ 1 (F 1 ,F 2 ,F 3 ) =(-V k Vk(Yi,Y 2 ,Y 3 ) - (-1 ) k b k (Y l7 Y 2 ) 

+ (-i) fe & fc (Yi,yj) - (-i) fc 6 fe (y 2 ,F 3 ) 

We define the following polynomials in t : 

P(t,X,Y) :=Y,bk(X,Y)t k 

k> 0 

Q(t, Y u Y 2 , Y 3 ) := £ ^(Y u Y 2 , Y 3 )t k 

k> 0 

Multiplying the preceding equation by (— 1 ) k t k and summing over k, we ob- 
tain 


Q(t, Y u Y 2 , Y 3 ) = Y l ,Y 2 , Y 3 ) + P(t, YuY 2 ) - P(t, YuY 3 ) + P(t , Y 2 , Y 3 ). 

( 6 . 10 . 16 ) 
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We now order the critical points pi, of the function / in such a manner 

that 

p{pi) > pipj) whenever i < j. 

For any i, we put 

Yi := Ki(i) := |J Y(p k ) 

k>i 

Y 2 :=Y 2 (i):= |J Y(p fc ) 

k>i+l 

Y 3 := 0. 

Thus Y 2 = Y 1 \W k (p i ). The pair (Y , Y>) may differ from the pair (Y, A) = 
(Y(P. i ),Y(P i )\W k (pi)) in so far as both Y 3 and Y 2 may contain in addition 
the same unstable manifolds of some other critical points. Thus, they are of 
the form (Y U B, A U B) for a certain set B. It is, however, obvious that the 
previous constructions are not influenced by adding a set B to both pairs, 
i.e. we have 

H k {Y UB,AUB) = H k (Y,A), for all k, 
because all contributions in B cancel. Therefore, we have 

HkiY^i), y 2 (i)) = H k (Y(pi), Y( Pi )\W k ( Pi )) = { J 2 f ° r k = ^ 

1 0 otherwise. 

(6.10.17) 

Consequently, 

P{t,Y lt Y 2 ) =t^ Pi \ (6.10.18) 

We now let pi be the number of critical points of / of Morse index i. Since 
the dimension of any unstable manifold is bounded by the dimension of X, 
we have pe = 0 for £ > dimX. (6.10.18) implies 

dimA' 

H (6.10.19) 

i= 0 t 

From (6.10.16), we obtain for our present choice of the triple (Yi, Y^,^) 

Pit, Y\ ( j) ,Y 2 (t))=P(t, Yi (i),0) - P{t, Y 2 (») , 0) 

+(1 + t)iX(t, Yi(i), Y 2 (f), 0), 

and summing w.r.t. i and using Yi(l) = X, we obtain 

dim X 

J2 Pit , Yi(0, Y 2 (z)) = Pit, X , 0) + (1 + t) Q(t) (6.10.20) 

i=0 

for a polynomial Q{t) with nonnegative coefficients. Inserting (6.10.19) in 
(6.10.20) and using the relation 
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P(t, X,4>)=Y^ t J dim Hj (X) ( since h o (X, 0) = Hj (X)) 
= ^2t j bj( x ) ( see Cor. 6-9.1) 


we conclude 


Theorem 6.10.2 Let f be a Morse-Smale-Floer function on the compact, 
finite dimensional orientable Riemannian manifold X. Let m be the number 
of critical points of f of Morse index l, and let b k (X) be the k-th Betti number 
of X. Then 

dim X 

Y, t j bj(X) + (1 + t) Q(t) (6.10.21) 

e=o 1 

for some polynomial Q(t ) in t with nonnegative integer coefficients. 

We can now deduce the Morse inequalities 


Corollary 6.10.1 Let f be a Morse-Smale-Floer function on the compact, 
finite dimensional, orientable Riemannian manifold X. Then, with the nota- 
tions of Thm. 6.10.2 

(i) p, k > b k (X) for all k 

(ii) n k - Hk - 1 + /z fc _ 2 - ... ± po > b k (X) - b k _i(X)... ± b 0 (X) 

(Hi) JX(— l) J fij — ')2j(—l) : 'bj(X) (this expression is called the Euler 

characteristic of X). 


Proof. 

(i) The coefficients of t k on both sides of (6.10.21) have to coincide, and 
Q(t) has nonnegative coefficients. 

(ii) Let Q(t) = From (6.10.21), we get the relation 

k k k— 1 

y t j p,j = ^2 t j bj(x) + (i + 1) ^2 t'hi + t k q k 

1=0 1=0 1=0 


for the summands of order < k. We put t = — 1. Since q k > 0, we 
obtain 

£(-irVi > £(-ir%. 

1=0 1=0 


(iii) We put t = — 1 in (6.10.21). 


□ 


Let us briefly return to the example discussed in § 6.1 in the light of the 
present constructions. We obtain interesting aspects only for the function fi 
of § 6.1. The essential feature behind the Morse inequality (1) is that for a 
triple (Li, 12 , 13 ) satisfying I3 C I 2 C Yi as in our above constructions, we 
always have 
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b k {Y u Y 3 ) < b k (Y 1 ,Y 2 ) + b k (Y 2 ,Y 3 ). (6.10.22) 


In other words, by inserting the intermediate space Y 2 between Y) and Y 3 , 
we may increase certain topological quantities, by inhibiting cancellations 
caused by the boundary operator d. If, in our example from § 6.1, we take 
Y\ = X, Y 3 = 0, we may take any intermediate Y 2 . If we take Y 2 = Y(p 2 ) 
(p 2 being one of two maximum points), then Yi\Y 2 = W k (pi) (p± the other 
maximum), and so 


and 


b k {Y 1: Y 2 ) 


b k (Y 2 ,Y 3 ) 


( 1 for k = 2 
1 0 otherwise 

f 1 for k = 0 
1 0 otherwise 


(we have dp 2 = p 3 , dp 3 = 2 p 4 = 0 in Y 2 ), 
since bk(X) = | ^ fort = 1 ' , 

we have equality in (6.10.22). If we take 
however, we get 


and so, 


Y 2 = Y(p 3 ) 

for k = 2 
otherwise 


(p 3 the saddle point), 


(since dpi = 0 = dp 2 in (Yi, Y 2 )) and 


6 fe (Y 2 ,Y 3 ) = {J 


for k = 1 
otherwise 


(since dp 3 = 0, but there are no critical points of index 2 in Y 2 ). Thus, in 
the first case, the boundary operator d still achieved a cancellation between 
the second maximum and the saddle point while in the second case, this was 
prevented by placing p 2 and p 3 into different sets. Generalizing this insight, 
we conclude that the Morse numbers pt arise from placing all critical points 
in different sets and thus gathering only strictly local information while the 
Betti numbers bn incorporate all the cancellations induced by the boundary 
operator d. Thus, the pt and the be only coincide if no cancellations at all 
take place, as in the example of the function fi in § 6.1. 


Perspectives. In this §, we have interpreted the insights of Morse theory, as 
developed by Thom[242], Smale[228], Milnor[183], Franks[76] 199-215, in the light 
of Floer’s approach. Schwarz[222] used these constructions to construct an explicit 
isomorphism between Morse homology and singular homology. 
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6.11 The Palais-Smale Condition and the Existence of 
Closed Geodesics 


Let M be a compact Riemannian manifold of dimension n , with metric 
(•, •) and associated norm || • || = (■, -} 5 . We wish to define the Sobolev space 
do = H 1 (S 1 ,M) of closed curves on M with finite energy, parametrized on 
the unit circle S 1 . We first consider H 1 (/, K") := R n ), where I is 

some compact interval [a, b], as the closure of C°°(I, R") w.r.t. the Sobolev 
H l ' -norm. This norm is induced by the scalar product 

b b 

(ci, c 2 ) := J Ci(t) • C 2 (t)dt + J dc ^ ■ dC 2 ( i^ dt, (6.11.1) 

a a 

where the dot • denotes the Euclidean scalar product on R". U 1 (/,]R") then 
is a Hilbert space. 

Since / is 1 -dimensional, by Sobolev’s embedding theorem (Theorem 
A. 1 . 7 ), all elements in H 1 (I,M n ) are continuous curves. Therefore, we can 
now define the Sobolev space H 1 (S 1 , M) of Sobolev curves in M via locali- 
zation with the help of local coordinates: 

Definition 6 . 11.1 The Sobolev space do = H 1 (S 1 ,M) is the space of all 
those curves c : S 1 —> M for which for every chart x : U — > (U open in 

M), (the restriction to any compact interval of) 

xo c: c~ l {U) — > R" 

is contained in the Sobolev space R n ). 

Remark. The space d 0 can be given the structure of an infinite dimensional 
Riemannian manifold, with charts modeled on the Hilbert space H 1 ’ 2 ( /, R ra ). 
Tangent vectors at c € do then are given by curves 7 € H 1 (S 1 ,TM), i.e. 
Sobolev curves in the tangent bundle of M, with 7 (t) € T C ^M for all t € S 1 . 
For 7172 € T c Aq, i.e. tangent vectors at c, their product is defined as 

(71,72):= J {D'y 1 (t),D'y 2 {t))dt, 

tes 1 

where D^i{€) is the weak first derivative of 7* at t, as defined in A.l. This 
then defines the Riemannian metric of do- While this becomes conceptually 
very satisfactory, one needs to verify a couple of technical points to make this 
completely rigorous. For that reason, we rather continue to work with ad hoc 
constructions in local coordinates. In any case, d 0 assumes the role of the 
space X in the general context described in the preceding §§. 
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The Sobolev space A 0 is the natural space on which to define the energy 
functional 

E ^) = \ J \\Dc(m 2 dt 
s 1 

for curves c : S 1 — > M, with Dc denoting the weak first derivative of c. 

Definition 6.11.2 ('u Tl ) rteN C A 0 converges to u £ A 0 in H 1 ’ 2 iff 

1) u n converges uniformly to u ( u n =4 it). 

2) E{u n ) — » E{u) as n —> oo. 

Uniform convergence u n u implies that there exist coordinate charts 

m 

R" (/i = 1 ,m) and a covering of S' 1 = U by open sets 

11=1 

such that for sufficiently large n 

Un{V^),u(V^) C Ufj, for n = 1, . . . , TO. 

If now p £ Co°(V^,lR ra ) for some g, then for sufficiently small |e| 

+ £¥>(*)) C fn(Un) for all t £ V^, 

i.e. we can perform local variations without leaving the coordinate chart. In 
this sense we write 

u + Eif 

instead of / p o it + ep. For such p then 
d \ d C 

feE{u + £<p)\e=0 = 2de 9ij ( U + + £ ^)(^ + £ <P j ) dt |e=0> 

where everything is written w.r.t. the local coordinate — > K" (“•” of 

course denotes a derivative w.r.t. t £ S 1 .) 

= J (9ij{u)u l p J + ^gij t k{u)u l u J p k )dt (using g i5 = g oi ) (6.11.1) 
If u £ if 2 ’ 2 (S' 1 , M), this is 

= - J ( gij(u)u l ip 3 + g iitl u l u l pp - ^ 9 ij,kU l u 3 p k )dt (6.11.2) 

= — J (it 1 + rit{u)u k u e )gij(u)p : i dt as in 1.4. 

We observe that <p £ H 1,2 is bounded by Sobolev’s embedding theorem (The- 
orem A. 1.7) (see also the argument leading to (6.11.5) below) so that also 
the second terms in (6.11.1) and (6.11.2) are integrable. 

We may put 
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\\DE{u)\\ = sup { ^E(u + £<p)| e= o : (6.11.3) 

ip £ Hq’ 2 (V lll W n ) for some /z, 

J 9ij( u ) ft&'dt < 1 } ■ 


For second derivatives of E, we may either quote the formula of Theorem 
4.1.1 or compute directly in local coordinates 

d 2 

-^E{u + e<p) |e=0 

Id 2 /' .... 

= 2 dE 2 gij(u + Etp)(u l + E(p l )(u J + £ip 3 )dt. 

= J (9ij(u) l P ll P J + 2g i ^ k iL l ip J ip k + gij^eid u 3 p k p e )dt 


which is also bounded for u and ip of Sobolev class H 1,2 . 
Suppose now that u £ A 0 satisfies 

DE(u) = 0. 


This means 

0 = J ( gij(u)u l ip 3 + ^gij,k(u)u l u 3 p k )dt for all ip £ H 12 . (6.11.4) 

Lemma 6.11.1 Any u £ Aq with DE(u) = 0 is a closed geodesic (of class 
C°°). 

Proof. We have to show that u is smooth. Then (6.11.2) is valid, and Theorem 
A. 1.5 gives 

u l + r^(u)u k u e = 0 for * = 1, . . . , dim M, 
thus u is geodesic. 

We note that u is continuous so that we can localize in the image. More 
precisely, we can always find sufficiently small subsets of S' 1 whose image 
is contained in one coordinate chart. Therefore, we may always write our 
formulae in local coordinates. We first want to show 

u £ H 2,1 . 

For this, we have to find v £ L 1 with 

J u l f)i = J v l r\i 

where we always assume that the support of ry £ C(f D (S 1 , M) is contained in 
a small enough subset of S' 1 so that we may write things in local coordinates 
as explained before. 
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We put 
Then 


:= g ZJ (u(t))rji(t) . 


u%dt = - j u%dt which is valid since u G H 1 ’ 2 
(: gij{u{t))u l (jP + g i j, k u k u L y°)dt 


(2 - g l j,k(u)u k u l (fi J )dt by (6.11.4) 

(^9tj,kg ke u l u J - gij,kU k u l g 3( -)rudt 

(^9 ze (djk/ - 9jt,k ~ gke,j)u 3 u k r]idt, renaming indices 


= — / r l jk u j u k T]idt. 


With i>* = -r i jk u j ii k G L 1 , the desired formula 


(6.11.5) 


U Tji — 


J v l r\i for 77 G Cq 3 (S 1 ,M) with sufficiently small support 


then holds, and 


uG H z 


By the Sobolev embedding theorem (Theorem A. 1.7) we conclude 

u € H 1,q for all q < 00. 


(We note that since S' 1 has no boundary, the embedding theorem holds for 
the H k ' p spaces and not just for Hq P . For the norm estimates, however, one 
needs \\f\\H k ’P(Q) on the right hand sides in Theorem A. 1.7 and Corollary 
A. 1.7, instead of just \\D k f\\LP.) 

In particular, u G ff 1,4 (l 7), hence 

ri k {u)u 3 u k g /.-. 

(6.11.5) then implies 

u G H 2 ' 2 

hence ii G C° by Theorem A. 1.2 again. 

Now 

j t {r jk {u)Vu k ) = 2/ + rj k>t iSv?u k (using r\ k = r kj ) 

G L 2 , 


since u G L 2 , u G L°°. Thus 
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riMvPif e //'- 2 . 

Then 

u G H 3 ’ 2 , 

by (6.11.5) again. 

Iterating this argument, we conclude 

u G H k ’ 2 


for all k G N, hence 
by Corollary A. 1.2. 


u G C°° 


□ 


We now verify a version of the Palais-Smale condition: 

Theorem 6.11.1 Any sequence (it„) ra6 N C Aq with 

E(u n ) < const. 

\\DE{u n )\\ — > 0 as n — > 0 

contains a strongly convergent subsequence with a closed geodesic as limit. 

Proof. First, by Holder’s inequality, for every v G Ao,tiA 2 G -S' 1 
*2 *2 

d(v(ti),v(t 2 )) < J (gij{v)v z v 3 )^dt < ((< 2 ~ti) J g l j(v)v z v J dt)? 

t\ ti 

< V2\t-2 — ti\? E(v)z . (6.11.6) 


Thus 


A 0 cCi(S\M), 


i.e. every H l -curve is Holder continuous with exponent and the Holder 
\ -norm is controlled by \j2E(v). 

The Arzela-Ascoli theorem therefore implies that a sequence with E(u n ) < 
const, contains a uniformly convergent subsequence. We call the limit u. u 
also has finite energy, actually 


E(u) < liminf E{u n ). 

n— >oo 

We could just quote Theorem 8.4.2 below. Alternatively, by uniform conver- 
gence everything can be localized in coordinate charts, and lower semiconti- 
nuity may then be verified directly. For our purposes it actually suffices at this 
point that u has finite energy, and this follows because the U 1,2 -norm (defined 
w.r.t. local coordinates) is lower semicontinuous under T 2 -convergence. 
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We now let be a partition of unity subordinate to (V)X=i,...,m> 

our covering of S 1 as above. 

Then 

/ m 

Y T h*(9ij( u n)K.K ~ 9ij{u)u l u J )dt (6.11.7) 

where the superscript /r now refers to the coordinate chart — > K™. 

In the sequel, we shall omit this superscript, however. 

By assumption (cf. (6.11.1)) 

J {g%j{u n )u l n <iP + ^gij,k{u n )u\u 3 n ip k )dt -> 0 as n -> oo 

for all ip £ H 1 ’ 2 . 

We use 

P 3 = Vv( U h ~ ul ) 

(where, of course, the difference is computed in local coordinates / /4 ) . 

Then 

J 5'ij,fc(«n)«?7p(Wn - uk ) dt 

< const. - m.axd(u n (t),u(t))E(u n ) 

-► 0 

as n — y oo since E(u n ) < const, and u n =£ u (after selecting a subsequence). 
Consequently from (6.11.1), since \\DE(u n )\\ — > 0, 



= J {( 9ij(u n ) <« - X + (ftjK) - NX* 7 (6.11.9) 

The first term goes to zero by (6.11.8). The second one goes to zero by uniform 
convergence and Holder’s inequality. 
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For the third one, we exploit that (as observed above, after selection of 
a subsequence) u n converges weakly in L 2 to u on V IJ; . This implies that the 
third term goes to zero as well. 

(6.11.9) now implies 


E(u n ) 


E(u ) as n 


oo 


(cf. (6.11.7).) 
u then satisfies 

DE(u) = 0 

and is thus geodesic by Lemma 6.11.1. □ 

As a technical tool, we shall have to consider the negative gradient flow 
of E. 

Remark. In principle, this is covered by the general scheme of § 6.3, but 
since we are working with local coordinates here and not intrinsically, we shall 
present the construction in detail. For those readers who are familiar with 
ODEs in Hilbert manifolds, the essential point is that the Picard-Lindelof 
theorem applies because the second derivative of E is uniformly bounded 
on sets of curves with uniformly bounded energy E. Therefore, the negative 
gradient flow for E exists for all positive times, and by the Palais-Smale 
condition always converges to a critical point of E, i.e. a closed geodesic. 

The gradient of E, VU, is defined by the requirement that for any c £ 
T 0 , Vfi(c) is the H 1 -ve ctor field along c satisfying for all iJ 1 -vector fields 
along c 


(VE(c),V) h i 

= DE(c)(V) = J (c, V)dt. (6.11.10) 

s 1 

Since the space of H l -ve ctor fields along c is a Hilbert space, X7E(c ) exists 
by the Riesz representation theorem. (The space of iL 1 -ve ctor fields along an 
H l - curve can be defined with the help of local coordinates). 

We now want to solve the following differential equation in A 0 : 

J t m = -V£W)) (6.11.11) 

£(0) = c 0 

where Co £ A 0 is given and <P : R + — > A 0 is to be found. 

We first observe 

Lemma 6.11.2 Let be a solution of (6.11.11). Then 
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Proof. By the chain rule, 

j t E{m) = (6.11.12) 

= < 0 . □ 

Theorem 6.11.2 For any cq £ A 0 , there exists a solution : K + — » A 0 of 

j t m = -VEWt)) ( 6 - 11 . 13 ) 

<P(0) = co- 


Proof. Let 

A := {T > 0 : there exists <P : [0,T] — > A 0 solving (6.11.13), with ^(0) = Co}. 

(That <P is a solution on [0, T\ means that there exists some £ > 0 for which 
is a solution on [0, T + £).) 

We are going to show that A is open and nonempty on the one hand 
and closed on the other hand. Then A = R + , and the result will follow. To 
show that A is open and nonempty, we are going to use the theory of ODEs 
in Banach spaces. For c £ A 0l we have the following bijection between a 
neighborhood U of c in Ao and a neighborhood V of 0 in the Hilbert space 
of H 1 -vector fields along c : For £ £ V 

£( r ) l— exp c(T) £(t) (6.11.14) 

(By Theorem 1.4.3 and compactness of c, there exists p 0 > 0 with the prop- 
erty that for all t £ S 1 exp c ( T ) maps the ball B(0,p o ) in T C ^M diffeomor- 
phically onto its image in M.) 

If <P solves (6.11.13) on [s, s + e] we may assume that £ > 0 is so small 
that for all t with s < t < s + £, <P(t) stays in a neighborhood U of c = 
<P(s) with the above property. This follows because since differentiable, in 
particular is continuous in t. Therefore, (6.11.14) transforms our differential 
equation (with its solution <£(f) having values in U for s < t < s + e) into a 
differential equation in V. an open subset of a Hilbert space. Since DE, hence 
VF is continuously differentiable, hence Lipschitz continuous, the standard 
existence result for ODE (theorem of Cauchy or Picard-Lindelof) may be 
applied to show that given any c £ A 0 , there exists £ > 0 and a unique 
solution E : [0, £] — > A 0 of ^S'(t) = —’VE(W(t)) with !F(0) = c. If ^ solves 
(6.11.13) on [0, to], th en putting c = #(£o), we get a solution on [0 , to + £], 
putting #(t) = ’Eft. — to). 
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This shows openness, and also nonemptyness, putting t 0 = 0. To show 
closedness, suppose $ : [0 , t) — > A 0 solves (6.11.13), and 0 < t n < T,t n — + T 
for n — > oo. 

Lemma 6.11.2 implies 


E(T>(t n )) < const. (6.11.15) 

Therefore, the curves <£(f n ) are uniformly Holder continuous (cf. (6.11.6)), 
and hence, by the theorem of Arzela-Ascoli, after selection of a subsequence, 
they converge uniformly to some ct G A 0 ; ct indeed has finite energy because 
we may assume that (<?(t n ))„ G N also converges weakly in H 1,2 to Ct, as in 
the proof of Theorem 6.11.1. By the openness argument, consequently we can 
solve 

j t m = -v£(*(t)) 

<?( f ) = ct 

for T < t < T + e and some e > 0. Thus, we have found T> \ [0, T + e) solving 
(6.11.13), and closedness follows. □ 

We shall now display some applications of the Palais-Smale condition 
for closed geodesics. The next result holds with the same proof for any C 2 - 
functional on a Hilbert space satisfying (PS) with two strict local minima. 

While this result is simply a variant of Prop. 6.2.1 above, we shall present 
the proof once more as it will serve as an introduction to the proof of the 
theorem of Lyusternik and Fet below. 

Theorem 6.11.3 Let ci, C2 be two homotopic closed geodesics on the compact 
Riemannian manifold M which are strict local minima for E ( or, equivalently , 
for the length functional L). Then there exists another closed geodesic C3 
homotopic to ci,C2 with 

E(cA = n := inf max E( A(r)) > max{Fi(ci), E(c 2 )} (6.11.16) 

Aezl t6[0,1] 

with A := A(c± , C2) := {A € C°([0, 1], Ho) : A(0) = Ci,A(l) = C2}, the set of 
all homotopies between C\ and C 2 - 

Proof. We first claim 

3<5 0 > 0V<5 with 0 < S < So 3e > 0 Vc with d\(c, d) = S : 

E(c)>E(a) + £ for* = 1,2. (6.11.17) 

Indeed, otherwise, for i — 1 or 2, 

V<5o 30 < 5 < (5oVn37„ with di(j n ,Ci) = S, 

E(jn) < E(Ci) + - 

n 
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If ||DS(7„)|| — > 0, then (7„) is a Palais-Smale sequence and by Theorem 
6.11.1 converges (after selection of a subsequence) to some 70 with 
rfi(70)Ci) = S,E( 70) = E(ci), contradicting the strict local minimizing prop- 
erty Of Cj. 

If || £>£1(7,1) || > 77 > 0 for all n, then there exists p > 0 with 

\\DE(j)\\ > | whenever di(7 n ,7) < p. (6.11.18) 

This follows, because ||£> 2 £1|| is uniformly bounded on Hi-bounded sets. 

(6.11.18) can then be used to derive a contradiction to the local minimiz- 
ing property of Ci by a gradient flow construction. Such a construction will 
be described in detail below. We may thus assume that (6.11.17) is correct. 
(6.11.17) implies 

k > max(£?(ci), Efo)). (6.11.19) 

We let now K K be the set of all closed geodesics, i.e. curves c in A 0 with 
DE(c) = 0, E(c) = K. homotopic to c\ and C2. 

We have to show 

± 0 . 

We assume on the contrary 

K K = 0 . ( 6 . 11 . 20 ) 

We claim that there exists ?? > 0, a > 0 with 

\\DE{c)\\>a (6.11.21) 

whenever c is homotopic to C \ , C2 and satisfies 

k — T] < E(c) < K + rj. (6.11.22) 

Namely, otherwise, there exists a sequence (7 n )neN of l£ 1 -curves homotopic 
to Ci , C2, with 

lim E( 7n) = k 

n —> 00 

lim DE( 7„) = 0 

n — ^00 

(7n)neN th en is a Palais-Smale sequence and converges to a closed geodesic 
C3 with E(cz) = k, contradicting our assumption K K = 0. 

Thus (6.11.21) has to hold if k — r] < E(c) < k + rj. 

From Theorem 6.11.2, we know that for any t > 0, there is a map 

ylo — * A 0 

c 1—1 A> t {c), where ^t(c) = #(t) solves 

im = -v£($(t)) 

£>(0) = c. 

With the help of this gradient flow, we may now decrease the energy below 
the level k, contradicting (6.11.20). For that purpose, 
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let A € A satisfy 

max E(X(t)) <k + t). (6.11.22) 

TG[0,1] 

Then, as in the proof of Lemma 6.11.2 

A(r))) = -||V£:(^(A(t)))|| 2 < 0. (6.11.23) 

In particular, for t > 0 

max£(<f ( (A(r))) < max£’(A(r)) < n + rj. (6.11.24) 

Since ci and C 2 are closed geodesics, i.e. critical points of E, X7E(d) = 0 (i = 
1,2), hence 

^t(ci) = Cj for all t > 0. 

Therefore 

<P t o A € A for t > 0. 

(6.11.21) , (6.11.23) imply 

-^£'(^i(A(r))) < — a 2 whenever £ l (^(A(r))) > k — rj. (6.11.25) 

(6.11.22) , (6.11.25) imply 


E($„(\(t))) < K-r) 

for s > and all r G [0,1], contradicting the definition of n. Therefore, 
(6.11.20) cannot hold, and the theorem is proved. □ 

As the culmination of this §, we now prove the theorem of Lyusternik 
and Fet 

Theorem 6.11.4 Each compact Riemannian manifold contains a nontrivial 
closed geodesic. 

For the proof, we shall need the following result from algebraic topology 
which, however, we do not prove here. (A proof may be found e.g. in E. 
Spanier, Algebraic topology, McGraw Hill, 1966.) 

Lemma 6.11.3 Let M he a compact manifold of dimension n. Then there 
exist some i,l < i < n, and a continuous map 

h : S l M, 

which is not homotopic to a constant map. 

In case M is a differentiable manifold, then h can also be chosen to be 
differentiable. □ 
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We now prove Theorem 6.11.^: 

We start with a very simple construction that a reader with a little expe- 
rience in topology may skip. 

Let i be as in Lemma 6.11.3. If i = 1, the result is a consequence of 
Theorem 1.4.6. We therefore only consider the case i > 2. h from Lemma 
6.11.3 then induces a continuous map H of the ( i — l)-cell D I_1 into the 
space of differentiable curves in M, mapping d D z ~ x to point curves. In order 
to see this, we first identify D l ~ x with the half equator {x 1 > 0, x 2 = 0} of the 
unit sphere S l in K l+1 with coordinates (a; 1 , . . . , x l+1 ). To p € ZA i_1 C S l , we 
assign that circle c p (t),t € [0,1], parametrized proportionally to arc length 
that starts at p orthogonally to the hyperplane {x 2 = 0} into the half sphere 
{x 2 > 0} with constant values of x 3 , ... , x l+1 . For p G d D l ~ x , c p then is the 
trivial (i.e. constant) circle c p {t ) = p. The map H is then given by 

H{p){t ) :=h o Cp(t). 

Each q G S l then has a representation of the form q = c p (t) with p G D l ~ l . 
p is uniquely determined, and t as well, unless q G dD l ~ l . A homotopy of H , 
i.e. a continuous map 

H : D 1 ^ 1 x [0, 1] — > {closed curves in M} 

that maps dD l ~ l x [0, 1] to point curves and satisfies = H , then 

induces a homotopy h : S l x [0, 1] -»Mof/i by 

h(q,s) = h(cp(t),s) = H(p,s)(t) 

(q = Cp(t), as just described). 

We now come to the core of the proof and consider the space 

A :={A : — > A 0 , A homotopic to H 

as described above, in particular mapping dD 1-1 to 
point curves}, 


and put 


k := inf max E(\(z)). 

XGAzGDi-i 


As in the proof of Thm. 6.11.3, we see that there exists a closed geodesic 7 
with 


7) = «■ 


It only remains to show that k > 0, in order to exclude that 7 is a point curve 
and trivial. Should k = 0 hold, however, then for every £ > 0, we would find 
some A e G A with 

max E(XJz)) < e. 

zeD ‘- 1 
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All curves A e (z) would then have energy less than e. We choose e < 
Then, for every curve c z := A e (z) and each t G [0, 1] 

d(c z (0) , c z (t)) 2 < 2 E(c z ) < pi . 

The shortest connection from c^O) to c z {t) is uniquely determined; denote it 
by Qz,t(s),s € [0, 1]. Because of its uniqueness, q z> t depends continuously on 
2 and t. H(z, s)(t) := q z ,t( 1 — s) then defines a homotopy between A e and a 
map that maps D l ~ l into the space of point curves in M, i.e. into M. 

Such a map, however, is homotopic to a constant map, for example since 
D l ~ 1 is homotopically equivalent to a point. (The more general maps from 
IT” 1 considered here into the space of closed curves on M are not necessarily 
homotopic to constant maps since we have imposed the additional condition 
that dD l ~ x = S' 4-2 is mapped into the space of point curves which is a proper 
subspace of the space of all closed curves.) This implies that A £ is homotopic 
to a constant map, hence so are H and h, contradicting the choice of h. 
Therefore, n cannot be zero. □ 


Perspectives. It has been conjectured that every compact manifold admits 
infinitely many geometrically distinct closed geodesics. “Geometrically distinct” 
means that geodesics which are multiple coverings of another closed geodesic are 
not counted. The loop space, i.e. the space of closed curves on a manifold has a rich 
topology, and Morse theoretic constructions yield infinitely many critical points of 
the energy function. The difficulty, however, is to show that those correspond to 
geometrically distinct geodesics. Besides many advances, most notably by Klingen- 
berg[158], the conjecture is not verified in many cases. Among the hardest cases 
are Riemannian manifolds diffeomorphic to a sphere S n . For n = 2, however, in 
that case, the existence of infinitely many closed geodesics was shown in work of 
Franks[77] and Bangert[13]. For an explicit estimate for the growth of the number 
of closed geodesics of length < £, see Hingston[119] where also the proof of Franks’ 
result is simplified. 

We would also like to mention the beautiful theorem of Lyusternik and Schnirel- 
man that any surface with a Riemannian metric diffeomorphic to S 2 contains at 
least three embedded closed geodesics (the number 3 is optimal as certain ellipsoids 
show). See e.g. Ballmann[10], Grayson[91], Jost[130], as well as Klingenberg[158]. 


Exercises for Chapter 6 

1) Show that if / is a Morse function on the compact manifold X, a < b, 
and if / has no critical point p with a < f(p) < &, then the sublevel 
set {x € X : /( x) < a} is diffeomorphic to {x G X : f(x) < b}. 

2) Compute the Euler characteristic of a torus by constructing a suitable 
Morse function. 

3) Show that the Euler characteristic of any compact odd-dimensional 
differentiable manifold is zero. 
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Show that any smooth function / : S" — > R always has an even 
number of critical points, provided all of them are nondegenerate. 

Prove the following theorem of Reeb: 

Let M be a compact differentiable manifold, and let / £ C 3 (M, R) 
have precisely two critical points, both of them nondegenerate. Then 
M is homeomorphic to the sphere S n (n = dim M). 

Is it possible, for any compact differentiable manifold M, to find a 
smooth function / : M — » R with only nondegenerate critical points, 
and with p,j = bj for all j (notations of Theorem 5.3.1)? 

(Hint: Consider RP 3 (cf. Chapter 1, Exercise 3 and Chapter 4, Exer- 
cise 5) and use Bochner’s theorem 3.5.1, Poincare duality (Corollary 
2.2.2), and Reeb’s theorem (Exercise 5).) 

State conditions for a complete, but noncompact Riemannian mani- 
fold to contain a nontrivial closed geodesic. (Note that such conditions 
will depend not only on the topology, but also on the metric as is al- 
ready seen for surfaces of revolution in R 3 .) 

Let M be a compact Riemannian manifold, p,q £ M,p ^ q. Show 
that there exist at least two geodesic arcs with endpoints p and q. 

In 6.2.1, assume that / has two relative minima, not necessarily strict 
anymore. Show that again there exists another critical point X 3 of / 
with f{x 3) > max{/(a:i), f(x 2)}. Furthermore, if k = inf max fix) = 

7 er ZS7 

f(x 1) = f(x 2), show that / has infinitely many critical points. 

Prove the following statement: 

Let 7 be a smooth convex closed Jordan curve in the plane R 2 . Show 
that there exists a straight line £ in R 2 (not necessarily through the 
origin, i.e. £ = {ax 1 + bx 2 + c = 0} with fixed coefficients a, 6, c) 
intersecting 7 orthogonally in two points. 

(Hint: 7 bounds a compact set A in R 2 by the Jordan curve theorem. 
For every line £ in R 2 , put 

La{£) := lengthen £). 

Find a nontrivial critical point £0 for La (i.e. La{£q ) > 0) on the set 
of all lines by a saddle point construction. See also J. Jost, X. Li-Jost, 
Calculus of variations, Cambridge Univ. Press, 1998, Chapter 1.3) 

Generalize the result of 10) as follows: 

Let M be diffeomorphic to S' 2 , 7 a smooth closed Jordan curve in 
M. Show that there exists a nontrivial geodesic arc in M meeting 7 
orthogonally at both endpoints. 

(Hint: For the boundary condition, see exercise 1 of Chapter 4.) 
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12) If you know some algebraic topology (relative homotopy groups and a 
suitable extension of Lemma 6.11.3, see E. Spanier, Algebraic topol- 
ogy, McGraw Hill (1966)), you should be able to show the following 
generalization of 11). 

Let Mo be a compact (differentiable) submanifold of the compact 
Riemannian manifold M. Show that there exists a nontrivial geodesic 
arc in M meeting Mq orthogonally at both end points. 

13) For p > 1 and a smooth curve c(t) in M, define 

E p (c) :=±j \\crdt. 

Define more generally a space of curves with finite value 

of E p . What are the critical points of E p (derive the Euler-Lagrange 
equations)? If M is compact, does E p satisfy the Palais-Smale condi- 
tion? 
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7.1 The Ginzburg-Landau Functional 

A prototypical situation for the functionals that we are going to consider is 
the following: 

M is a compact Riemannian manifold, E a complex vector bundle over 
M, i.e. a vector bundle with fiber C", equipped with a Hermitian metric (•, •). 
We consider sections p of E and unitary connections Da = d + A (locally) 
on E. Here, “unitary” of course means that A is skew Hermitian w.r.t. (•, •). 
We denote the curvature of Da = d + A by Fa, and we write \p\ for (ip, ip) 2 . 
We consider Lagrangians of the type 

£(<P,A):= J('yi\F A \ 2 + j2\D A <p\ 2 + 'Y 3 V(ip))*(l) (7.1.1) 

M 

Here 71, 72, 73 are positive constants, while V(-) is some “potential”. If V(<p) 
is quadratic in \<p\, e.g. 

V(p) = m 2 \p\ 2 , (7.1.2) 

the resulting Euler-Lagrange equations are linear in p, 

D* A D A p + m 2 p = 0. (7.1.3) 

The Euler-Lagrange equations also contain a equation for variations of A, 
namely 

71 D* a F a = 2 ((p,D A p) + (D A <p, <p)) (7-1.4) 

(see also the proof of Lemma 7.1.1 below for the derivation of these equa- 
tions). 

It leads to a richer structure, however, if we allow V (p) to be a polynomial 
of higher than quadratic order in \p\ . Of particular interest to us will be the 
case of a 4 th order polynomial, for example 

V(p) = (cr- M 2 ) 2 , 

for some cr G R. 

We first consider the case where the base manifold is a compact Riemann 
surface E equipped with a conformal metric, and where the vector bundle is 
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a Hermitian line bundle L , i.e. with fiber C, and a Hermitian metric (•, •) on 
the fibers. 

Definition 7.1.1 The Ginzburg-Landau functional for a section y of L and 
a unitary connection Da = D + A on L is defined as 

£(<P, A) := J (W + \D A y\ 2 + \ (a - M 2 ) 2 ) *(1) (7.1.5) 

E 

( a € R). 

The reason for the factor | will emerge in a moment. A simple calculation 
yields 

Lemma 7.1.1 The Euler- Lagrange equations for the Ginzburg-Landau func- 
tional are 

D*a d aT=\{^- W\ 2 )t (7.1.6) 

D* a F a = -R e(D A <p, <p) (7.1.7) 


Proof. The term J\Fa\ 2 was handled already in §3.2 when we derived the 
Yang-Mills equation. Varying 


(DaT,F>ap) 


(7.1.8) 


w.r.t. A yields 

j t J(D A +tBT, D A +tBV)\ t = o = J ((L> at, By) + {By, D A y )) . 

Thus (7.1.7) readily follows (cf. also (7.1.4) above). Varying (7.1.8) w.r.t. y 
yields 

4 f{D A {tp + tip),D A {y + tip)) = j {{D* A D A y,ip) + (if, D* A D A y)) . 

dtJ h = 0 J 

Finally, the r.h.s. of (7.1.6) obviously arises from varying 

f\( a - \f\ 2 Y 

w.r.t. y. □ 


Remark. (7.1.7) is linear in A. Namely, as explained in §3.2 (cf. (3.2.24)), 
for an abelian structure group, D A F A becomes d*F A , and so (7.1.7) is 

d* ( dA °’ 1 - dA 1 ’ 0 ) = —Re((d + A)y, y) 
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(in the notations of (7.1.13) below) which is obviously linear in A (but not 
in ip). 

Since Da is a unitary connection, A is a 1-form with values in u(l), 
the Lie algebra of U(l). This Lie algebra will sometimes be identified with 
(U(l) is a subgroup of the Lie group G1(1,C), and u(l) is a subalgebra 
of the Lie algebra gI(l,C). The latter can be identified with C. Likewise, 
Gl(l, C) can be identified with C*, the nonvanishing complex numbers, and 
U(l) then corresponds to to the complex numbers of the form e z1? , d £ R. 
Taking derivatives, u(l) then corresponds to the complex numbers of the form 
it, t £ R.) Thus, A , A 1,0 A 0-1 , and the curvature Fa will then be considered 
as imaginary valued forms. This will explain certain factors i appearing in 
the sequel. 

We should point out that the convention adopted here (which is a conse- 
quence of more general conventions used in other places in the present book) 
is different from the convention employed in the physics literature, where one 
writes a unitary connection as 

d — iA 

with a real valued A. In other words, our A corresponds to —iA in the physics 
literature. 

We decompose Q 1 , the space of 1-forms on E, as 

Q 1 = l? 1,0 $ i? 0,1 , (7.1.10) 

with l? 1,0 spanned by 1-forms of the type dz, l? 0,1 by 1-forms of the type dz. 
Here, z of course is a local conformal parameter on E, and with z = x + iy, 
we have z = x — iy. From the beginning of §8.2, we recall the conventions 

dz = dx + idy dz = dx — idy 

d_ = i(d__ i d_\ a_ = i(d_ +i a\ 

dz 2 y dx 1 dy ) dz 2 \dx ^ 1 dy ) ' 

If dx ’ dy are an orthonormal basis of the tangent space of E at the point 
under consideration, we get 

(dz, dz) = (dx + idy, dx + idy) = (dx, dx) + i(dy, dx) — i(dx, dy) + (dy, dy) = 2 
(dz, dz) = 2, (dz, dz) = 0. (7.1.11) 

The last relation in (7.1.11) implies that (7.1.10) is an orthogonal decompo- 
sition. We may also decompose Da into its (1,0) and (0,1) parts 

Da = 9a + 9 a- 


Thus 


&aP £ fi 1,0 (L), 9 aP £ 12 0,1 (L) for all sections ip of L. (7.1.12) 


We also write 
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d A = d+A 1 ’ 0 , d A = d+ A 0 ' 1 , 


(7.1.13) 


with 

d= d+d 

being the decomposition of the exterior derivative. Here we have 
df — df 

df = Tr~dz, df = -rr-dz, for functions on £. 

J dz J dz 

We write the conformal metric g on £ in our local coordinates as 

p 2 (z)dzdz 


Given zq € £, we may assume that 

P 2 (z 0 ) = 1, (7.1.14) 

simply by replacing our coordinates z by z. We may then describe the 
action of the * operator of the metric p 2 dzdz at z o as follows 


We also recall 

hence 

and 


*dz = *(dx + idy) = dy — idx = — idz 
*d~z = idz 

dz A dz = —2 idx A dy, 

*(dz A dz) = —2 i*(dx A dy) = —2 i 
*(1) = dx Ady — -dz A dz. 


(7.1.15) 

(7.1.16) 

(7.1.17) 

(7.1.18) 


We compute 

d A d A p ={d + A 1 ’ 0 ) o(d + H 1 ’ 0 ) <p 

= ddip + A 1 ’ 0 A dip + A 1 ’ 0 A A lfi <p + ( dA h0 ) tp - A 1 ' 0 A dtp 
= 0 (7.1.20) 


since dd = 0 and A 10 A A 1,0 + dA 1,0 is a (2,0) form which has to vanish as 
£ has complex dimension 1. 

Likewise 

dd= 0 (7.1.21) 

Moreover 

d A d A ip = ddp + A lfi A dtp + A 10 A A 0,l <p + (9H 0 ’ 1 ) p - A 0 ' 1 A dp 

d A d A p = ddp + A 0 ' 1 A dp + A 0 ’ 1 A A lfi p + (9H 1 ’ 0 ) p - A 1 ' 0 A dp 

= -d A d A p + (SH 1 ’ 0 - dA 0 ’ 1 ) p (7.1.22) 

= —d A d A p - F A p 
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i.e. 

FA = -(d A dA + dAd A ) (7.1.23) 

Theorem 7.1.1 We have 

A) = J ^2|9a^| 2 + ~\( a ~ M 2 )) ^ *(!) + 27rcrdegL 

(7.1.24) 

with 

degF •= ci(L)[E\ (the degree of the line bundle L). 


Proof. We compute (writing F in place of Fa) 

j(*(-iF) - 1 (a- |<p| 2 )^ *(1) 

= J '(\F\ 2 + ^ (c - |^| 2 ) 2 - cr*iF * iF(<p, ifi)^ *(1) 


Now 

Also 


j*iF*(l) = JiF = 2ttci (L)[E\ = 27rdeg L 


(7.1.25) 

(7.1.26) 


f(*iFip,<p)*( 1) = j(-i(d A d A + d A d A )ip,*ip)*( 1), using (7.1.23) 


In order to proceed, let zq G A, and choose Riemannian normal coordinates 
with center Zq. Thus, p 2 (zq) = 1, and the first derivatives of the metric vanish 
at Zq . Also, we apply a gauge transformation so that A(zq) = 0 (see Lemma 
3.2.3). Since we are not going to commute any derivatives any more, no second 
derivatives of the metric or first derivatives of A will enter our subsequent 
computations at Zq, and we may therefore proceed with our computations as 
in the Euclidean case. Thus, we have to evaluate 

—i {{<Pz) x dz A dz + (ip z )-dz A dz ) , -pdz A dz ^ * (1) 

= ^ y 2 a<pz) z ■ p - (<fzh ■ <p) * (i) 

(since (— idz A dz,idz A dz) = — \dz A dz \ 2 = —4 as (-, ■) is Hermitian) 

= 2 J (WzVz - WzPz) * ( 1 ) 

= - J (| 3 a < p | 2 - | 9 a <^| 2 ) *( 1 ) 

(the factor 2 disappears since (dz, dz) = (dz, dz) = 2, and in our coordinates 
d<p = <p z dz etc.). Thus we have shown 
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- J(*iFip,<p)*( 1) = 

J(\d A p\ 2 - \d A g> *( 1 ) 

(7.1 

.27) 

Finally, of course 




\dm 2 = 

\d A ipf + \d A p\~ 

(7.1 

.28) 

since the decomposition 




n 1 = 

f2 1,0 © I? 0 ’ 1 




is orthogonal. The result then follows from (7.1.25) - (7.1.28). □ 

Theorem 7.1.1 has the following useful consequence 

Corollary 7.1.1 Assume deg L > 0. Then the lowest possible value permitted 
by the global topology of the bundle for £(<p, A) is realised precisely if <p and 
A satisfy the set of I s ^ order differential equations 

d A tp = 0 (7.1.29) 

*(iF) = ±(a- M 2 ) (7.1.30) 

□ 


Remark. If degL < 0, then these equations cannot have any solution, because 
for any solution, £(ip,A) would be negative by (7.1.24) whereas we see from 
(7.1.5) that for any <p,A, £((p,A) > 0. Thus, in case degL < 0, one has to 
consider the selfduality equations arising from the following expression for 
the Ginzburg-Landau functional: 

a ) = J (^\d A p>\ 2 + ^ *(-iF ) - ^(<7 - M 2 ) j j *(1) - 27rdegL 

(7.1.31) 

which is derived through the same computations. W.l.o.g., we shall assume 
degL > 0 in the sequel. 


Integrating (7.1.30) yields the inequality 

27rdegL = JiF = ^ J(a - M 2 )*(l) < | Area (£) 

with ^Area(A') = f *(1)^. Thus, a necessary condition for the solvability of 

47rdegL 


V 

(7.1.30) is 


(j > 


Area(A) ’ 

and in fact, we must have strict inequality in (7.1.32) unless ip = 0. 


(7.1.32) 
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Corollary 7.1.1 constitutes another instance of the phenomenon of self- 
duality that we already encountered in §3.2 when we discussed the Yang- 
Mills functional on a fourdimensional Riemannian manifold. The equations 
(7.1.29), (7.1.30) are also called selfduality equations because the solutions 
of these first order equations are precisely those solutions of (7.1.6), (7.1.7) 
that realize the lower bound imposed by the topology for the functional and, 
if they exist, yield the absolute minima for the functional considered. In 
fact, this remark, namely that these equations hold for the absolute min- 
ima, makes it clear that any solution of (7.1.29), (7.1.30) automatically also 
solves (7.1.6), (7.1.7), as the latter are the Euler-Lagrange equations for the 
Ginzburg-Landau functional, and as such have to be satisfied in particular by 
minimizers of that functional. Of course, it may also be checked by a direct 
computation that solutions of (7.1.29), (7.1.30) also solve (7.1.6), (7.1.7). 

The selfduality may be generalized as follows. Instead of C(ip, A ), we con- 
sider for e > 0 

UV>,A) := J^e 2 \F A \ 2 + \D A ip\* + - M 2 ) 2 } *(1) (7.1.33) 

= J^\9a^\ 2 + (e*(iF) - ^(<7 - M 2 )) | *(1) + 27rdegL 

which leads to the selfduality equations 

d A y = 0 (7.1.34) 

e 2 *(iF) = ^(o r-M 2 ). (7.1.35) 

Still more generally, in place of e, one may consider a function f(z) on S, for 
example f , . This leads to the functional 

(7 - L36) 

= J{j^\F a (z)\ 2 + \DMz)\ 2 + - M 2 ) V*)| 2 } *« 

= /{ 2 |(W| 2 + (|^7y|*( iF ) - M 2 )M^ | *(1) +27rdegL 

with the selfduality equations 

d A <p = 0 (7.1.37) 

e 2 *(LF)= l(a-M 2 )M 2 - (7-1.38) 

The functionals C e and C e are quite important for studying phase tran- 
sitions in superconductivity. 
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For studying solutions, the following consequence of the maximum prin- 
ciple is very useful 

Lemma 7.1.2 Let £ be a compact Riemann surface with a conformal metric, 
L as before. For any solution of (7.1.6), hence in particular for any solution 
of (7.1.29), we have 

\tp\ < a on £ (7.1.39) 


Proof. From (7.1.6), we obtain 

= (D* A D A tp, ip) - {D A ip,D A ip) 

(cf.(3.2.7)) = U a ~ M 2 )M 2 - \D A ip\ 2 . 

Let Zq £ £ be a point where \p\ 2 achieves its maximum. We may assume 
A = 0 at z 0 (cf. Lemma 3.2.3), hence D A ip = 0 at zq. If we had |<^(z 0 )| > cr, 
then at zq 

A\ip\ 2 < 0 


which contradicts the maximum principle. 


□ 


Perspectives. It was shown by Taubes[237] that on K 2 , one may solve the 
Ginzburg-Landau equations with any given finite collection prescribed as zero set 
for ip, with prescribed multiplicities. This result was extended to compact Riemann 
surfaces by Bradlow and Garcfa-Prada, and these authors also found generaliza- 
tions on higher dimensional Kahler manifolds. References include [29, 30], [84, 85, 
86]. We should also mention Hitchin’s penetrating study [120] of the equations 

d A p = 0 
Fa + W>, P*} = 0 


on a compact Riemann surface. 

The limit analysis for e — > 0 of the functional C t (p, A) and the solutions of the 
equations (7.1.34), (7.1.35) on a compact Riemann surface has been carried out by 
Hong, Jost, Struwe[122[. The result is that away from the prescribed zero set of 
p e (the “vortices”), \<p t \ uniformly converges to 1, and D A<s p e and dA e uniformly 
converge to 0, whereas the curvature in the limit becomes a sum of delta distribu- 
tions concentrated at the vortices. Of course, the number of vortices counted with 
multiplicity has to equal the degree of the line bundle L, deg L. This result thus 
yields a method for degenerating a line bundle on a Riemann surface into a flat 
line bundle with degL singular points (counted with multiplicity) and a covariantly 
constant section. 

Results for the p 6 theory on a compact torus can be found in Caffarelli, 
Yang[35], Tarantello[236], Ding, Jost, Li, Wang[58]. For the case of S' 2 , see Ding, 
Jost, Li, Wang[59]. The general case was solved by Ding, Jost, Li, Peng, Wang[57[. 
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7.2 The Seiberg-Witten Functional 

Let M be a compact, oriented, four dimensional Riemannian manifold with 
a spin c structure P c , i.e. a spin c manifold. (As mentioned in §1.8, in the four 
dimensional case, there always exists some spin c structure on a given oriented 
Riemannian manifold.) As in Definition 1.8.10, the determinant line bundle 
of this spin c structure will be denoted by L, and as in Definition 3.4.1 (ii), the 
Dirac operator determined by a unitary connection A on L will be denoted by 
V A . Finally, we recall the half spin bundle S 41 defined by the spin c structure, 
as remarked after Definition 1.8.10 (we omit the subscript for the dimension, 
as the dimension is fixed to be 4 in the present §). By Lemma 3.4.5, T> a maps 
sections of S ± to sections of S T . 

Definition 7.2.1 The Seiberg-Witten functional for a unitary connection A 
on L and a section ip of S + is 

SW(p,A) := J (jV^| 2 + |F+| 2 + ^M 2 + *(1) (72.1) 

M 

where is the spin c connection induced by A and the Levi-Civita connec- 
tion of M (cf. 3.4.7), is the selfdual part of the curvature of A , and R is 
the scalar curvature of M. 

The discussion of the Seiberg-Witten functional will parallel our discus- 
sion of the Ginzburg-Landau functional in §7.1. In fact, the structure of SW 
is quite similar to the one of C , containing a square norm of the curvature 
of the connection A, the square of the norm of the covariant derivation of ip, 
and a nonlinearity that is a fourth order polynomial in \ip\. 

Lemma 7.2.1 The Euler- Lagrange equations for the Seiberg-Witten func- 
tional are 

V* a Va<P=-(j + (7.2.2) 

d*F+ = — R e(V A <p, ip) (7.2.3) 

Proof. As the proof of Lemma 7.1.1. □ 

In order to proceed, we need to associate to s € Sf the two-form r(s) 
defined by 

w) := (v ■ w ■ s, s) + ( v , ic)|s| 2 


Lemma 7.2.2 
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t(s) € rl 2,+ (fK) (i.e. t(s) is a selfdual 2-form that 

assumes imaginary values), 

and 

l T ( s )| 2 = 2|s| 4 . 

Proof. We first show that r(s) takes imaginary values. We start with the skew 
symmetry. 

t(s)(v, w ) = (v ■ w ■ s, s) + (v, tu)|s| 2 = ((—w ■ v — 2(v,w))s, s) + (v,w)\s\ 2 
= — t(s)(w , v) 

Next 

w) = (v ■ w ■ s, s) + ( v , it?) |s| 2 = (s, v ■ w ■ s) + ( v , tu)|s| 2 
= —(v ■ s,w ■ s) + (t?,w)|s| 2 by Corollary 1.8.4 
= (w ■ v ■ s,s) + ( v , w)|s| 2 for the same reason 

= t(s)(w, v ) 

= —t(s)(v,w) by skew symmetry 

This implies that t(s)(v,w ) is in ?'R. 

For the computation of |r(s)| 2 , we recall that the spin representation 
r : C1 C (R 4 ) — > C 4x4 , and the half spin representation that we shall now 
denote as T + : Cl c,ei; (]R 4 ) — > = C 2 . We write s = (s^s 2 ) € C 2 and 

obtain from the formulae for r(e 0 ,e^) from §1.8 

r(s)(e i,e 2 ) = i(sV + s¥) = r(s)(e 3 ,e 4 ) 

T(s)(ei,e 3 ) = s 1 s 2 _- sV = -r(s)(e 2 ,e 4 ) 
r(s)(ei,e 4 ) = *(s 1 s 1 - s 2 s 2 ) =r(s)(e 2 ,e 3 ) 

This already implies that r £ A 2 ’ + . 

We may now compute 

kO) I 2 = l T ( s )( e i’ e i)l 2 

i<j 

= 2 ((s 1 ? - S 2 S 2) 2 + + s V) 2 _ (s 1^2 _ s 2 ?) 2 ) 

= 2\s\\ 

□ 


In more explicit terms we may write 

r(s) = (ej ■ efc • s, s)e J A e k 

where e- 7 is a frame in T* M dual to the frame Cj on TM (j = 1, ... ,4). 
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Theorem 7.2.1 The Seiberg -Witten functional (7.2.1) can be expressed as 
SW { p,A) = j(^V A p\ 2 + FX~\(e r e k .p,p)e^e k j *(1) (7.2.4) 

where e 3 , j = 1, . . . , 4, are 1 -forms dual to the tangent vectors ej, j = 1, ... ,4, 
i.e. e 3 (e ^ ) 

Proof. We have 

I Fa - \( e o ' e k ' T,V>)e 3 Ae*j 2 = |F+| 2 + ^(e,- • e k ■ p,p)e 3 Ae fc | 2 

“ \( F A’ eP Aek )( e i ■ e k ■ <P,<P) (7.2.5) 

By Lemma 7.2.2 

r^\{ej ■ e k ■ <p,<py Ae k \ 2 = ^M 4 (7.2.6) 

Writing F X = F(f e' A e 1 , we get 

-\( F Xi e3 Ke k )(ej-e k -p,p) = -^{F^e, ■ e k - <p,<p) = ~^(FXp,p) (7.2.7) 

On the other hand, the Weizenbock formula of Theorem 3.4.2 yields, (apply- 
ing (3.4.19) to ip, taking the scalar product with p, integrating, and using 


the self adjointness of T> a) that 

J \V A p\ 2 = J \S/ A p\ 2 + \R\p\ 2 + \{F+p, p) (7.2.8) 

The result follows from (7.2.5) - (7.2.8). □ 

Corollary 7.2.1 The lowest topologically possible value of the Seiberg-Witten 
functional is achieved precisely if p and A are solutions of 

V A p = 0 (7.2.9) 

F a = • e k ■ p, p)e 3 A e k (7.2.10) 


Definition 7.2.2 The equations (7.2.9) and (7.2.10) are called the Seiberg- 
Witten equations. 

Thus, we see the mechanism of selfduality at work once more. The absolute 
minima of the Seiberg-Witten functional for which the above lower bound is 
achieved satisfy not only the the second order equations (7.2.2), (7.2.3), but 
also the first order Seiberg-Witten equations (7.2.9), (7.2.10). 



396 7. Variational Problems from Quantum Field Theory 


So far our discussion of the Seiberg-Witten functional has been completely 
analogous to the one of the Ginzburg-Landau functional, except that so far, 
the parameter a in the latter has had no analogue in the former. However, this 
can easily be achieved by choosing a 2-form /i and considering the perturbed 
functional 

SW^ip, A) = J (\V A p\ 2 + I Fa ~ \ ( e i ' e k ' V,v)e 3 A e k + p\ 2 ^j *(1) 

= / (|V^| 2 + \F+\ 2 + *\p\ 2 + (7.2.11) 

\V~ \(ej ' e k ■ P,<p}e J Ae fc | 2 + 2(F|,/x)^*(l) 

If we assume that /u, is antiselfdual, then 

(Ft,n) = 0 (7.2.12) 

as F~ A [ by definition is selfdual and the decomposition of the 2- forms on a four 
dimensional manifold into selfdual and antiselfdual ones is orthogonal (see 
§3.2). Thus, in that case the additional term in (7.2.11) disappears. 

If we assume that /r is a closed selfdual form, then 

( f a . = 0) 

again since the antiselfdual form FJ is orthogonal to the selfdual forms, and 
hence 

(FaiV) = ( F A,fj)- 

Further, since Fa represents the first Chern class ci(L) of the determinant 
line bundle L (see §3.2), and since /i is assumed to be closed, hence represents 
a cohomology class [/x], 

f(F A ,n)*( 1) (7-2.13) 

M 

does not depend on the connection A (see the discussion of Chern classes in 
§3.2), hence represents a topological invariant, denoted by (ci(L) A [/i] ) [M] . 
This expression then plays a role that is completely analogous that one of 
27rdegL in the discussion of the Ginzburg-Landau functional. 

The corresponding first order equations for SW^ are 

V A <p = 0 (7.2.14) 

f a = |( e i • e/t • ip, ip)e 3 Ae fc -/i (7.2.15) 

Since, by our conventions, both F + and (ej -ek-^p, p)e? A e k are imaginary 
valued, (7.2.15) may only admit a solution if we assume that /i is imaginary 
valued as well. As in the Ginzburg-Landau theory, one may also introduce 
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a scaling factor e > 0 or a scaling function like A into the Seiberg-Witten 
functional. For example, one may define 


SWn >e (<p,A) = 1 1 |V^c ^| 2 + e 2 |F +| 2 + ||^| 2 

M ^ 


(7.2.16) 


+ 4 I V- (ej ■ e k ■ W,y)e 3 A e fe | 2 + 2 (F|,/x) U(l) 


— / % I^av 3 ! 2 + 


M 


1 /I 


eFJ - - ( -(e, • e fe • ip, p)e J A e k - p 


K 1 )- 


We have a maximum principle similar to Lemma 7.1.2: 


Lemma 7.2.2 For any solution p of (7.2.2), hence in particular for any 
solution of (7.2.9), on a compact four- dimensional Riemannian manifold, we 
have 

max|t/?| 2 < max(— R(x), 0) (7.2.17) 

m xe m 


Proof. (7.2.2) implies 

^M 2 = (V* A V A <P, p) - |Vav?| 2 (c/.(3.2.7)) 

= -(f + ^M 2 ) M 2 -|V^| 2 . 

Let Xo £ M be a point where | V^ 4 <^| 2 achieves its maximum. Then 

A\p(x 0 )\ 2 > 0. 


Thus, 


and (7.2.17) follows. 


R{x 0 ) + \p(x 0 )\ 2 < 0, 


□ 


Corollary 7.2.2 If the compact, oriented, Riemannian Spin c manifold M 
has nonnegative scalar curvature, then the only possible solution of the 
Seiberg-Witten equations is 
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Proof. By Corollary 7.2.1, solutions of the Seiberg-Witten equations (7.2.9), 
(7.2.10) also solve (7.2.2), (7.2.3). From Lemma 7.2.2 we conclude that in 
case R > 0, the only solution of (7.2.2) is 

ip = 0. 


(7.2.10) then yields = 0. □ 

In fact, the conclusion of Corollary 7.2.2 may also be obtained directly 
from Theorem 7.2.1 as follows: From (7.2.4) is clear that for any solution of 
(7.2.9), (7.2.10), we have SW(ip 7 A) = 0. If R > 0, (7.2.1) on the other hand 
implies that SW(ip , A) = 0 can only hold if all terms in the integral in (7.2.1) 
vanish. Hence tp = 0, = 0. 

Perspectives. The Seiberg-Witten equations were introduced by Seiberg and 
Witten[223, 224], The mathematical relevance of these equations was first shown 
by Witten[257], Taubes[238, 239], Kronheimer and Mrowka[164]. Further references 
can be found in the monographs of Salamon[212] and Morgan[187]. The equations 
and their applications are also described in several survey articles, among which 
we mention Friedrich[80] (see also [81]). All these references have been useful in 
assembling the material presented here. 

As in the case of other gauge theories like the Yang-Mills theory discussed in 
§3.2, the functional and the equations are invariant under the action of a gauge 
group. Here the structure group is U(l), and so the Gauge group Q consists of 
maps from M into U(l) = S 1 , u £ Q acts on a pair (pp, A) via 

u*(ip, A) = (u~ 1 ip,u~ 1 du + A). 


One has 


T> u *a(u 1 p) = u 1 VaP> 


and 


F u *a = Fa 


so that the functional and the equations (including the perturbed ones) remain 
invariant under the action of Q. For a given spin c structure P c , Riemannian metric 
g and imaginary valued selfdual 2 -form p as pertubation, one considers the space 
of solutions of (7.2.14), (7.2.15) modulo the action of Q. This space is called moduli 
space _M(M, P c , g, p) of solutions. One writes the second Betti number 62 of M as 


b 2 =b + + b~ 


where b + ( b ) is the dimension of the subspace of H 2 (M,R) represented by 
(anti)selfdual 2-forms. In Seiberg-Witten theory, it is shown that in case b + > 0, 
the moduli spaces M(M, P c , g, p) are finite dimensional, smooth, compact, oriented 
manifolds, at least for “generic” p. The compactness here comes from the fact that 
solutions satisfy uniform estimates. (Lemma 7.2.2 and estimates for higher deriva- 
tives, see e.g. Jost, Peng, Wang[143] for a general presentation) that imply conver- 
gence of subsequences of families of solutions. This is different from the situation 
in Donaldson’s theory of (anti)selfdual connections on SU(2) bundles where no uni- 
form estimates hold. The most useful case seems to be where the moduli space is 
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zerodimensional, i.e. where one has a finite number of solutions. The theorem of 
Seiberg-Witten says that if b + > 1 and b + — b~ is odd, then the number of solutions 
counted with orientation is independent of the choice of the Riemannian metric g 
and the pertubation g and depends only on the spin c structure P c on M . Also, 
these moduli spaces are nonempty only for finitely many spin c structures. If (M, g ) 
in addition has positive scalar curvature, then in fact all Seiberg-Witten invariants 
vanish (cf. Corollary 7.2.2). On the other hand, such Seiberg-Witten invariants, i.e. 
numbers of solutions counted with orientation, can often be computed from gen- 
eral index theorems, i.e. from topological data alone, and when these numbers are 
found to be nonzero, this yields an obstruction for certain compact, oriented, dif- 
ferentiable 4-manifolds to carry metrics with positive scalar curvature. For results 
based on such ideas, see e.g. Le Brun[32]. The Seiberg-Witten theory can be used to 
prove, to reprove and to extend many results from Donaldson theory. Kronheimer- 
Mrowka[164] and Morgan, Szabo, Taubes[188] used Seiberg-Witten theory to prove 
the Thom conjecture, stating that smooth algebraic curves (i.e. compact complex 
smooth subvarietes of complex dimension one) in CP 1 2 minimize the genus in their 
homology classes. 

The Seiberg-Witten equations seem to be particulary useful on symplectic 4- 
manifolds ( M,u > ). Using ito as a perturbation and using the limit e — > 0 for the 
parameter e introduced into the equations above (see (7.2.16)), Taubes[240, 241] 
showed that in the limit the zero set of the solution is a collection of pseudoholo- 
morphic curves in the sense of Gromov[99]. Also, the curvature Fa will concentrate 
along the pseudoholomorphic curves in the limit e — > 0. In this way, one may identify 
the invariants defined by Gromov that are very useful in symplectic geometry, but 
hard to compute, with the invariants of Seiberg-Witten that can typically be com- 
puted from topological index theorems. For a generalization of the Seiberg-Witten 
functional with a potential term of 6 th order, see Ding, Jost, Li, Peng, Wang[57]. 


Exercises for Chapter 7 

1) Show by a direct computation that (7.1.29), (7.1.30) imply (7.1.6), 
(71-7). 

2) Derive the Euler-Lagrange equations for the functional defined in 
(7.2.16). 
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8.1 Definitions 

We let M and N be Riemannian manifolds of dimension m and n, resp. 
If we use local coordinates, the metric tensor of M will be written as 

(0fx/3)a,/3=l,...,m, 


and the one of N as 


We shall also use the following notations 

(7 a/3 )a,/3=i,...,m = (inverse metric tensor) 

7 := det(7 a/3 ) 


r 0r, ■■= ^°‘ S h0S,r l + 'yr l S,0 


l 0 r!,s) (Christoffel symbols of M) 


and similarly 

n ij n 
9 ? jk’ 

If / : M — » is a map of class C 1 , we define its energy density as 


'(/>(*) 


( 8 . 1 . 1 ) 


in local coordinates (x 1 , . . . , x m ) on M, (f 1 , . . . , /") on N. 

The value of e(f)(x) seems to depend on the choices of local coordinates; 
we are now going to interpret e(/) intrinsically and see that this is not so. 
For this purpose, we consider the differential of /, 


df 


df_ 

dx a 


dx a ® 


d 

W v 


a section of the bundle T*M ® f~ 1 TN. 

f~ 1 TN is a bundle over M with metric ( Qij{f(x ))), while T*M of course 
has metric (7 a/3 (a :)), cf. (1.5.5). Likewise, we have for the Levi-Civita con- 
nections: 
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d d . 

v - ! ;-np v ^ or by the cham rule 

dx a l3 df k 

V a dx^ = —T^dx 1 cf. (3.1.20), which follows from 


(8.1.2) 


= S/H, 0 = ^(dx»(-E)) 
= 


(8.1.3) 


5 1 
9 7; ) 

55^ 9a; 7 


We shall also employ the convention that the metric of a vector bundle 
E over M will be denoted as 

(■> ')e- 

Then, with 


e(f) _ I a/3/^L JT) 1 

eUj_ 2 7 { dx«’ dxd } f- 1TN 
= ^(dfidf) T * M ®f-i TN - 


(8.1.4) 


f-'TN I s th e pullback by / of the metric tensor of N, and conse- 
quently e(f) is its trace (up to the factor |) w.r.t. the metric on T*M. We 
may also express (8.1.4) as 


e(/)= 2 1 


(8.1.5) 


where the norm || • || involves the metrics on T*M and f 1 TN. 


Definition 8.1.1 The energy of a C^-map / : M — > N is 

E{f ) := J e(f)dM (8.1.6) 

M 

(with dM = pydx 1 A ... A dx m in local coordinates, being the volume form 
of M). 


Of course, E generalizes the energy of a curve in N, i.e. a map from, say, 
S 1 to N as considered in chapter 7 and earlier. 

Another, even simpler special case is where N = R. We then have the 
Dirichlet integral of a function / : M — > R. 
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M 


Our aim in this chapter is to find critical points of E. These will then 
be higher dimensional generalizations of closed geodesics on N. One can also 
consider them as nonlinear analogues of harmonic functions on M. 

Lemma 8.1.1 The Euler- Lagrange equations for E are 


1 d 


7! ^ + = o- <8-1.7) 


d 


v «/3 


d 


d 


Definition 8.1.2 Solutions of (8.1.7) are called harmonic maps. 

Remark. If M = S 1 with its metric in standard coordinates, (8.1.7) reduces 
to the familiar equation for geodesics. 

Proof. Let / be a smooth critical point of E. Then / is in particular con- 
tinuous, and we may localize our computations in local coordinates in both 
domain and image. In this sense, let a smooth ip be given in such local coordi- 
nates, with compact support, and consider the variation f+tp for sufficiently 
small |t|, the sum being taken again in local coordinates. As / is a critical 
point of E, 

j f E(f + tp) , t=o = 0 (8.1.8) 

So far, in fact, it sufficed to suppose / to be of class C 1 . We now assume / 
to be of class C 2 so that the equations (8.1.7) are meaningful. (8.1.8) gives 


° = Jt\j +M X )) 

M 




' dx a v dxP dx 13 

df <V 


= J (7 ap {x)gij{f{x)) 


dx a dx@ 


M 


making use of the symmetry gij = gji 

= - f (>/rr ...dx m 


M 


- / T a P( x )^^9ij,k{f( x ))<P j VTdx 1 ...dx 


M 


J lT a0 ( x )9ijAf( x ))^f^T k Vi dxl •••«**" 

M 
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where we may integrate by parts since has compact support in M. We put 
Tji = gijtpi , and thus ^ = g lJ r)i. We then obtain 

M 

~ J \l “V- 7 (. 9ij,k + 9kj,i - 9ik,j ) ^ . . . dx m (8.1.9) 

M 

using the symmetry 7“^ = 7^“ in the second integral above. 

The claim then follows from Theorem A. 1.5. □ 


Later on, the smoothness of critical points of E will be an important and 
often difficult issue. For the moment, however, rather than discussing this 
question further, we want to interpret (8.1.7) from an intrinsic point of view. 

We let tp be a vector field along /; this just means that tp is & section of 
f~ 1 TN. In local coordinates 

tp induces a variation of / by 


ft(x) ■= ex P j (x) 


We want to compute 




As an auxiliary computation 


# = v r a lTN (r^-y^dx a 

a aj l 

d , ib i d u d d 

= ^w>® dxa+rr ^w® dxa ' 


( 8 . 1 . 10 ) 


( 8 . 1 . 11 ) 


writing J^f^as above. 

We now also have to take derivatives w.r.t. t. Here is a vector tangent 
to Mxl. The Levi-Civita connection on M and the trivial connection on R. 
yield the Levi-Civita connection on T*(M xl)® f~ 1 TN. Moreover, instead 

of a , = (/*V JV ) a , we shall simply write V a . 

“■t (.55*7 ex* 


Vid/i = 

at at Qx a 


d 


= 

dx ' 


df 

d 

dt df l 
and V is torsion free 


<g> dx a (cf. (8.1.2)) 


( 8 . 1 . 12 ) 




since and commute 


= dtp (cf. (8.1.11)) 
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d 

dt " 


since the derivative of exp p at 0 € T p M is the identity ((1.4.10)). Then 

|t=o = \J f t (df t ,df t )dM lt=0 

M 

= J (df , V a_dft)dM\t—o 

M 

= J (df, dip)dM by (8.1.12) 

M 

= J (df,\7_^(^)^dx a )dM (by (8.1.11)) 

M 

= — / (V d df , if 0 dx a ) dM since V is metric 
J ^ 

M 

= —J (tra ce\7df,ip)dM. (8.1.13) 


M 


Thus, intrinsically, the Euler-Lagrange equations for E are 

r(/) := trace Vd/ = 0. 


(8.1.14) 


t is called the tension field of /. 

Let us also check directly that (8.1.7) and (8.1.14) are equivalent: 

We let V denote the Levi-Civita connection in T*M (g) f~ x TN as before. 


V a (d/) = V a (^f-dx a ^~) 
51 ? w ’ 51 ? v dx a df 1 ' 


T * M dx a ) — 9 
' ’dx^dp 


d df 1 d 

51 ? df 1 dx a 

d 2 d d d 

dx a dxd df 1 dx a df 1 

r , d dp df d y> 

10 df k dx@ dx a 

We then obtain for the components of r(/) = trace V df 


(8.1.15) 


‘(/) = 7' 


= V*/ 3 . 


d 2 p 

dx a dxP 


pi dff 

a(3 Q xl 


*/3 pi d P d f 


dx a dxP 


(8.1.16) 


This shows that (8.1.7) and (8.1.14) are indeed equivalent, since one easily 
computes 
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J__ a p_&_ 

yj 7 ) dx a * dxP 


dx a dxP 7 0r ^dx-y ' (8-1-17) 


The operator 




is the negative of the Laplace-Beltrami operator of the Riemannian manifold 
M, cf. (2.1.12). We recall 

A M f = — div grad/, 


with 


grad / = 7 ' 


d 


(2.1.13) 




dx a dx@ 

= (2 - L15) 
is a harmonic function iff 


A M f = 0 . 

Besides closed geodesics and harmonic functions, there is another easy exam- 
ple of a harmonic map. 

The identity map id : M — > M of any Riemannian manifold is harmonic. 
This follows for example from (8.1.16): 

if f(x) = x, then = S i7 , = S ja , = S k0 , and thus r(/) = 0 . 
Also: 

Corollary 8.1.1 An isometric immersion f : M — > N is harmonic if and 
only if it represents a minimal submanifold of N. 

Proof. From (3.6.25). □ 


Perspectives. An intrinsic calculus for operators on vector bundles and harmonic 
maps is developed in [68], . Some older survey articles on harmonic maps are [67, 69] 
and [128], the latter also containing a list of open problems with detailed references. 
Some more recent references will be given in the Perspectives on the subsequent §§. 

In quantum field theory, harmonic maps occur as solutions to the nonlinear a- 
problem. The supersymmetric version of this problem recently inspired an extension 
of the concept of harmonic maps, the so-called Dirac-harmonic maps that couple the 
map with a nonlinear spinor field while preserving the essential structural properties 
of harmonic maps, see [47, 46] . 
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8.2 Twodimensional Harmonic Mappings and 
Holomorphic Quadratic Differentials 


Definition 8.2.1 A Riemann surface is a complex manifold (cf. Definition 
1.1.5) of complex dimension 1. 

Thus, coordinate charts on a Riemann surface £ are given by maps 

P i • Ui * 

Ui open in £, for which the transition functions 

Pj o ip~ x : (pi{Ui n Uj) — > <pj{Ui n Uj ) 

are holomorphic maps between open subsets of C. 

We write coordinates in C as 


z = x + iy . 


For a coordinate transformation w = w(z),w = u + iv, we thus have the 
Cauchy- Riemann equations u x = v y , u y = —v x , and in particular u x u x + 
v x v x = UyUy + VyV y , u x u y + v x v y = 0, and we see that a Riemann surface has 
a conformal structure in the sense of Definition 3.6.6. 

We call z = ip(p) for a local chart ip a local conformal parameter at p € £ 
and define operators 


d_ 

dz 

and 1-forms 
These satisfy 


Id .8 8 18 .8 

2 8x 1 8y dz 2 8x ? 8y 


dz = dx + idy, dz = dx — idy. 


(cf. 6.1) 


*(| ) = 1 = *(|), <k(|) = o = <&(|). 

A map between Riemann surfaces is called holomorphic or antiholomorphic if 
it has this property in local coordinates. This does not depend on the choice 
of local coordinates because all coordinate changes are holomorphic. 


Definition 8.2.2 A Riemannian metric (•,•) on a Riemann surface £ is 
called conformal if in local coordinates it can be written as 

p 2 (z)dz®dz (8.2.1) 

(p(z) a positive, real valued function). 
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This means 


d d d d 

Pz' dz' K dz'dz h 

, d d 2 


If we want to express this in real coordinates, we compute 
dz®dz = dx 0 dx + dy ® dy, 

hence 

( Pb = p2(2) = ( |r |;> = °- 

In the same manner as Theorem 1.4.1 is proved, a partition of unity argument 
gives 


( 8 . 2 . 2 ) 

(8.2.3) 

(8.2.4) 

(8.2.5) 


Lemma 8.2.1 Every Riemann surface admits a conformal metric. □ 

Of course, every conformal metric is Hermitian in the sense of Definition 
5.1.2, and conversely. 


Definition 8.2.3 Let E be a Riemann surface, N a Riemannian manifold 
with metric or gijdf 1 (g> d/ J in local coordinates. A C 1 -map f : E —> N 

is called conformal , if 


d f df, ,df df . 
dx' dx^ N ~ dy 1 dy ^ 


dj_ df 
dx' dy 


)n ~ 0 - 


( 8 . 2 . 6 ) 


In local coordinates this is of course expressed as 


= *,(/(4)f f, = o. (8.2.7) 

For the sequel, it will also be instructive to write this condition in complex 
notation, namely 


0 =( 


df df . 

dz' dz ^ 


9jk(f(z))( 


df j df k 
dx dx 


djpdff_ djpdp 

dy dy ? dx dy 


( 8 . 2 . 8 ) 


Lemma 8.2.2 Holomorphic or antiholomorphic maps between Riemann sur- 
faces are conformal, if the image is equipped with a conformal metric. 

The proof is obvious. □ 

Lemma 8.2.3 Let E be a Riemann surface with a conformal metric A 2 {z). 
Then the Laplace- Beltrami operator is 
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4 d 2 
A 2 (z) dzdz' 


(8.2.9) 


Proof. Direct computation. □ 

Lemma 8.2.4 Let £ be a Riemann surface with conformal metric A 2 (z), N 
a Riemannian manifold with metric tensor (gif). Then a map f : £ — > N of 
class C 2 is harmonic iff 

+ Hk (/(*)) d £%=° fori = 1, • • • , dim N. (8.2.10) 

It is a parametric minimal surface iff it is harmonic and conformal. 

Proof. One checks directly that (8.2.10) is equivalent to (8.1.7). The second 
claim directly follows from the Definition 3.6.7 of a parametric minimal sur- 
face. □ 


Corollary 8.2.1 If £ is a Riemann surface, N a Riemannian manifold, the 
harmonic map equation for maps f : £ — > N is independent of the choice of 
conformal metric on £. Thus, whether a map is harmonic depends only on 
the Riemann surface structure of £, but does not need any conformal metric. 

Proof. The metric of £ does not appear in (8.2.10). □ 

Corollary 8.2.2 Holomorphic or antiholomorphic maps between Riemann 
surfaces are harmonic. 

Proof. Such maps obviously satisfy (8.2.10). □ 

More generally 


Corollary 8.2.3 If k : £\ — > £2 is a holomorphic or antiholomorphic map 
between Riemann surfaces, and f : £2 — > N is harmonic, then so is f o k. 


Proof. Let w be a local conformal parameter on £ 1 . Then, if for example k 
is holomorphic, and in local coordinates k = z(w), we have 


hence 


dz 

dw 


= 0 , 


df o k _ cffdz^ df ok _ df_ 
dw dz dw ’ dw dz dw 


and 
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d 2 f l o k t dp o k df o k = cfff_ t dp_df_ dz_dz_ 

dwdw jt dw dw { dzdz il dz dz’dwdw 1 

and this vanishes if / is harmonic. □ 

Let E, N be as before, A 2 (z)dz ® dz a conformal metric on E. 

The energy of a map / : E — > N is written as 

1 r ^ q fi q fj / 

E{f) = - j 9ij -4^ ~ryi — ^ — P{z)dz A dz (since dx A dy 

E 

f dp dp r - 1 

£ 


= -dz A dz) 


( 8 . 2 . 11 ) 


Corollary 8.2.4 The energy of a map from a Riemann surface E into a 
Riemannian manifold is conformally invariant in the sense that it does not 
depend on the choice of a metric on E, but only on the Riemann surface 
structure. Also, if k : E\ — > Eq, is a bijective holomorphic or antiholomorphic 
map between Riemann surfaces then for any f : E 2 — > N (of class C 1 ) 

E(fok) = E(f). 


a 

Note: Even if the image is also a Riemann surface, the energy of / does 
depend on the image metric. 

Theorem 8.2.1 Let E be a Riemann surface, N a Riemannian manifold 
with metric (-,-) N , or in local coordinates. If f : E — > N is 

harmonic, then 

ip{z)dz 2 = <^, d p N dz 2 (8.2.12) 

is a holomorphic quadratic differential. (Here, we use the abbreviation 

dz 2 := dz (g> dz, 

and <p(z)dz 2 is a holomorphic quadratic differential, if ip(z) is a holomorphic 
function, dz 2 just expresses the transformation behavior. Thus 

ip(z)dz 2 

is a section ofT^E 0 Tf,E, with T£E := T* E 0 C.) 

Furthermore 

ip(z)dz 2 = 0 4==> / conformal. 
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Proof. In local coordinates 




and we have to show for a harmonic / 




w , tft \\®f l df\ _ 0 v j vj- : wj 

9ij(f ( z )) ^ Q z ) — 2 9ii + 9ij,k 


d 2 / 1 d/ J ' 

9ij dz&zlh 


df k dp dp 
dz dz dz 


„ d 2 p dp .df k dpdp 

~ 9ij ~dP&z ~dP + (mfc + ~&P~dP~dP 

„ dp / d 2 f i df k dfP 
9l3 Ph^d^z + H ldPlh^ 

= 0, if / is harmonic. 

Finally, <p(z)dz 2 = 0 is equivalent to the conformality of /, see (8.2.8). □ 

In intrinsic notation, the proof of Theorem 8.2.1 goes as follows 


d df df 

dz < dz ’ <9z 


5/ df. 


dp d df , 


K dz' dz >JV_2(v A dz* d^ N ~ 2(v dz 


^2 j?i Q fj Q jk Q Qj? 

= 2<( &ai + ** aFar’ap SI 1 " = °- since 7 is h “ ic - 


We also note from this computation 


t(/)=4Vjl^. 

s* <9z 


(8.2.13) 


In real notation, we have of course 

<p(z)dz 2 (8.2.14) 

= It - ^ 

The easiest example of a compact Riemann surface is S 2 = {(x\,x 2 ,x 3 ) £ 
]R 3 : x 2 + x% + x 2 = 1} with the following two coordinate charts: 


fi ■ s 2 \{(0, 0, 1)} -> C, fi{xi,x 2 ,x 3 ) = 


1 - x 3 


(x\ + ix 2 ) 
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h '■ S 2 \{(0,0, -1)} ->■ C, f 2 {x i,X 2 ,X 3 ) 


1 

1 + x 3 


(xi - ix 2 ). 


We compute 


1 

fi{xi,x 2 ,x 3 ) 


f 2 (x i,x 2 ,x 3 ) 


so that f 2 o /i 1 (z) = \ and the coordinate transformation f 2 of 1 1 : C\{0} 
C\{0} is holomorphic as required. 


Lemma 8.2.4 Every holomorphic quadratic differential on S 2 vanishes iden- 
tically. 


Proof. We put z = fi(x) and write a holomorphic quadratic differential in 
the chart /i as 

p(z)dz 2 , with p : C(= /i( 5 2 \{(0, 0)})) — > C holomorphic. 

Then with f 2 (x) = w = f for z ^ 0 

r) 2 1 1 

p(z)dz 2 = p(z(w)){^—) 2 dw 2 = <p(z(w)) — rdw 2 . 

aw u> 4 

Since we have a holomorphic quadratic differential on S 2 , this has to be 
bounded as w — ^ 0. We conclude that p is a holomorphic function on C with 

p(z) — > 0 as 2 — > oo, 

hence p = 0 by Liouville’s theorem. (One may also apply Lemma 8.3.7 below) 

□ 


Another Proof. In the preceding notations, for A £ C\{0}, 

z e- > Xz 

induces a holomorphic map h\ : S 2 — > S 2 , fixing (0, 0, 1) and (0, 0, —1). Since 
h\ also depends holomorphically on A € C\{0}, 

dh\(z) _ ~ 
dX |a=i 

represents a holomorphic vector field V(z) on S 2 . 

Now if q is a holomorphic quadratic differential and V\ . V 2 are holomorphic 
vector fields on a Riemann surface E 1 then 

q(V U V 2 ) 

is a holomorphic function on E. Thus 


rj{z) ■= p{z)dz 2 (V(z),V{z)) = p{z)z 2 
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represents a holomorphic function on the compact Riemann surface S 2 and 
therefore is constant (for example by Corollary 2.1.2 and Corollary 8.2.3 or by 
an easy application of the maximum principle), hence rj = 0, since ry( 0) = 0, 
hence ip = 0. □ 

Corollary 8.2.5 For any Riemannian manifold N, every harmonic map 

h:S 2 -+N 

is conformal, i.e. a parametric minimal surface. 

Proof. From Theorem 8.2.1 and Lemma 8.2.4. □ 

We look again at the family h\ = S 2 — > S 2 of holomorphic selfmaps of 
S 2 , given in the chart /i by 

2 A Z. 

We equip the image S 2 with any conformal metric and compute the energy 
E w.r.t. this metric. We observe for A G C\{0} 

E(h\) = const, yf 0. 

Namely, we write h\ = id o h x and apply Corollary 8.2.4 with / = id(= hi), 
k = h\, hence 

£(/ii) = E(h x ) for all A G C\{0}, 

and since h\ yf const, for A G C\{0}, this energy cannot vanish. Now if A — > 0, 
h\ converges pointwise on S 2 \{(0, 0, — 1)} to the constant map h 0 (z) = 0 
(again in the chart /i), and 

E(ho) = 0. 

We thus have found a sequence of holomorphic, hence harmonic (Corollary 
8.2.2) maps, hence critical points of E , i.e. 

DE{h x ) = 0 for all A G C\{0} 


with 


E(h\) = const, y^ 0 


with the property that this sequence converges for A — ■> 0 pointwise almost 
everywhere to a map ho with 


E(h 0 ) ± lim E(h x ). (8.2.15) 

We conclude 


Theorem 8.2.2 The energy functional for maps from S 2 to S 2 (the image 
equipped with any conformal metric) cannot satisfy any kind of Palais- Smale 
condition. □ 
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The statement is somewhat vague because we have not yet given a precise 
definition of the Palais-Smale condition in the present context. Any mean- 
ingful definition, however, should require that a sequence of critical points 
(/«)ne n of E contains a subsequence converging in some sense to be specified 
towards a map / with 

E(f) = lim E(f n ). 

n—> oo 

Definition 8.2.4 A Riemann surface E with (smooth) boundary dE is a 
differentiable manifold with boundary and charts with values in C and C+ := 
{z = x + iy £ C, y > 0}, resp., and holomorphic coordinate changes. 

O 

Again, in this case E = E\dE is a Riemann surface in the sense of 
Definition 8.2.1. Also, dE is a differentiable manifold of real dimension 1. 


Example. 


D := {z = x + iy £ C : \z\ < 1} 
with dD = {\z\ = 1}. 


Definition 8.2.5 A holomorphic quadratic differential q on a Riemann sur- 
face E with boundary dE is called real on dE if for all Zq £ dE and 
V1,V2 £ T Zo dE , i.e. vectors tangent to the boundary 

q(v i, v 2 ) £ R. 

Let z o £ dE, f : U —* C + a chart defined on a neighborhood of zq, z = 
x + iy £ C+. In this chart, we write a holomorphic quadratic differential as 

ip(z)(dx + idy) 2 = (u + iv) (dx 2 — dy 2 + 2idxdy) with u= R eip,v= Im tp 

( 8 . 2 . 16 ) 

= u{dx 2 — dy 2 ) — 2 vdxdy + i(v(dx 2 — dy 2 ) + 2 iudxdy). 

When applied to a vector tangent to dC + = {y = 0}, dy vanishes. Thus, the 
holomorphic quadratic differential is real on dE if 


v = Im ip = 0 


for all such boundary charts. 

Lemma 8.2.5 Any holomorphic quadratic differential on D which is real on 
dD vanishes identically. 

Proof. A holomorphic function h on an open subset fl of C+ which takes real 
values on 9C+ can be reflected as a holomorphic function to fl := {x + iy : 
a ' — iy £ 12} via h(x + iy) := h(x — iy). This is the Schwarz reflection principle. 
In the same manner, a holomorphic quadratic differential on an open subset 
of C+ which is real on 9C+ can be reflected across 3C+ . Thus, a holomorphic 
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quadratic differential on D which is real on dD can be reflected to a holo- 
morphic quadratic differential on S 2 . Namely, since /i(<S 2 \{(0, 0, 1)}) = C 
in our above notation, we may consider D as a subset of S' 2 , and we reflect 
ip(z)dz 2 across dD as 


(p(w)dw 2 = tp(z)dz 2 for w = — 

z 

= p{~)\dw 2 . 

w 


The result now follows from Lemma 8.2.4. 


□ 


Theorem 8.2.3 Let h : D — > N be a harmonic map into a Riemannian 
manifold with 

h\ dD = const. 

Then 


h = const. 


Proof. We denote the metric of N by (fjjk)- In local coordinates defined on 
an open subset of C+, the holomorphic quadratic differential associated to h 
(Theorem 8.2.1) is 


l2 „.„.dh?dh k dhP dh k , 

Tdz =g jk (h(z))(— — - -—-2 i^-^r-){dx + idy) 


dx dx dy dy 


dx dy 


Since h\g D = const., = 0 on SC + . Thus 


dW dh k 

Im ip = 2g jk — — = 0 on dC+, 


and Lpdz 2 is real on the boundary. Lemma 8.2.5 implies <pdz 2 = 0. Therefore 
h is conformal. Since = 0 on 9C+, then also |) = 0 on 3C+. Since h 

r\ 2 i 

is harmonic and = 0 on <9C+, the harmonic map equation gives also 
= 0 on dC + . Iteratively , all derivatives of h vanish on dC + . Hence we 
can reflect h smoothly as a harmonic and conformal map across 9C+ via 
h(z) = h(z) for z = x + iy with y < 0. This means that we can reflect h to a 
harmonic and conformal map 


h : S 2 —> N 


mapping dD = {|^r| = 1} (considering D as a subset of S 2 as above) onto a 
single point. 

In the sequel, we shall use the abbreviation 


u z := 




du .du 
dx dy 


) 5 U z 


1 du . du 
Tfrc +l Py 


), 
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even for functions u : C — > i.e. with real values, with componentwise 

differentiation. Thus, for every zq, 

u z (z o) € C d . 


We now need the Hartman-Wintner-Lemma 


Lemma 8.2.6 Suppose fl is a neighborhood of 0 in C, u £ C 2 (f2 , R d ) satisfies 

\u Z z\ < K\u z \ (8.2.17) 


for some constant I\ in fi. 

If 

lim u(z)z n+1 = 0 (assume the limit exists) (8.2.18) 

z—>0 

for some n £ N, then 

lim u z (z)z~ n 

z— >0 

exists. If (8.2.18) holds for all neN, then 

u = 0. (8.2.19) 


Proof. For a compact subregion B of 12 with smooth boundary and g £ 
C 1 (B,C), we have the integration by parts formula 



{u z g z + u zz g) 


dz A dz 
2 i ’ 


where it is the exterior normal of B. 
We assume now 


( 8 . 2 . 20 ) 


We choose 

with 

and 

Then 

(8.2.20) yields 


lim u z z l k = 0 for some k £ N. 

z—>0 

B ■= {z £ C : £ < \z\ < R, \z — w\ > e} 

0 < 3 e < R < min(dist (0, 812), ^) 
w £ f2,2e <\w\ < R — e 

g{z) = z- k {z-w)- 1 . 

g z = 0 in B. 


( 8 . 2 . 21 ) 
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( j f u z z k (z — w ) l \dz\ — J) u z z k (z~w) l \dz\ 

J\z\ = R j \z\=£ 

— <j> u z z~ k (z — w)~ l \dz\ 

J \z — w\=£ 


' | Z — w\=£ 

_u, . *dzAdz 

u zz z (z - w) — — — 
? ll 


( 8 . 2 . 22 ) 


We now let e — > 0. Because of (8.2.21), the second integral on the left hand 
side of (8.2.22) then tends to 0. The third one tends to 

27 Tll z (w)w~ k 

by Cauchy’s integral formula. Consequently for 0 < |w| < R 


27 tu z (w)w k = (p u z z k (z — w ) l \dz\ — f u zz z k (z — w ) 
J\z\=R J\z\<R 


u, . ,dz Adz 


and (8.2.17) implies for 0 < |u;| < R 


2tt\u z {w)w fc | < (f | u z z k (z — w ) 1 ||d,z| 
J\z\=R 


T . [ i I, ,_i dz A dz 

+ R / |u*||z| \z-w\ — — — . 

J\z\<R Zl 


(8.2.23) 


Two auxiliary points: 


. _ n dw A dw 

I \z-w\ — — — < / \z-w\ - — — — S 47m 

J\w\<R zl J\z-w\<2R Zl 

~ — = — (— + )• 

Z — W W — Zo Z — Zo Z — W W — Zo 

We then multiply (8.2.23) by | w — z 0 | _1 (kol < R) and integrate w.r.t. w : 


i dw A dw , „ 
" — < 4tt1? 


_ 1 dw A dw 


2 7T / \u z \\w- k \\w - z 0 \~ l — c 

J\w\<R i 

<87 tR (J) \u z z~ k {z — z 0 )~ 1 \\dz\ 

J\z\=R 


+ 8irRK / \u z \\z\ fc |. 2-zo| 1 

J\z\ <R 


(8.2.24) 


dz A dz 
2 i 


Hence, renaming some of the variables 

(1 — ARK) f \u z \\z\- k \z - w\~ x 
J\z\ <R 


dz A dz 


(8.2.25) 


<4 R<f \u z z~ k (z — w) _1 ||d^| . 
J\z\=R 
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The right hand side of (8.2.25) remains bounded as w — > 0 and consequently 
so does the left hand side. Then the right hand side of (8.2.23) remains 
bounded as w — > 0, and consequently also the left hand side. Therefore 

lim u z (z)z~ k (8.2.26) 

z — »0 


exists. 

If k < n, this limit then has to vanish because of (8.2.18), and hence 
(8.2.21) holds for k + 1 instead of k. 

The first assertion now follows by induction on k : 

It is trivial for n = 0. For n > 1, (8.2.18) implies (8.2.21) for k = 1. By 
induction, we get (8.2.21) for k = n, and hence the limit in (8.2.26) exists 
which is the first assertion of the lemma. 

For the second assertion, k = n — 1 and w — + 0 in (8.2.25) gives 

(1-4 RK) [ \ Uz \\ z \-^± (8.2.27) 

J\z\<R 

<AR(f \u z \\z\~ n \dz\ 

J\z\=R 


for all n. 

If u ^ 0, there exists z 0 with |z 0 | < R and 


\u z (z 0 )\ =c^0. 

Then the left hand side of (8.2.27) would grow in u at least like c\zo\~ n , the 
right hand side at most like d R~ n , with d = AR sup^i =i j \u z \. Since |^o I < R, 
(8.2.27) then could not hold for all n. This contradiction proves the second 
assertion. □ 


We can now easily conclude the 

Proof of Theorem 8.2.3: We may assume of course that in local coordinates 

h{dD) = 0. 

In the same local coordinates as in the beginning of the proof, we have noted 
above that all derivatives of h vanish on 3C+. Thus, if e.g. 0 is in the image 
of our coordinate chart, 

lim h{z)z~ n = 0 for all n G N. 

z—>0 

Since h is harmonic 

\h zz \ < c 0 \h z \\h z \ 

<K\h z \, 

in a neighborhood of 0 since h is smooth. 

Lemma 8.2.6 then yields h = 0(= h(dD)). □ 
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More generally, Lemma 8.2.6 implies 

Corollary 8.2.6 Let £ be a Riemann surface, N a Riemannian manifold of 
dimension d, h : £ — > N harmonic. 

Then for each zq £ £ there exists m £ N with the property that in any 
local coordinates around h(zo), there exists a £ C d with 

h z {z ) = a(z - z 0 ) m + 0(|* - z 0 \ m ) (8.2.28) 


for z near z o- 

If h z (zo) = 0,m > 1. In particular, the zeroes of h z are isolated, unless h 
is constant. 

If h is conformal, i. e. 

g jk hP z h k z = 0, 

then 

gjk(Hz 0 ))a J a k = 0 . 

Proof. We apply Lemma 8.2.6 with u = h — h(zo). As above, since h is 
harmonic and smooth 

\h zz \ < co\h z \\h z \ 

<K\h z \ , 

so that (8.2.17) holds. All claims follow easily. □ 

We want to discuss a consequence of Theorem 8.2.3. 

We look at (continuous) maps 

f-.D^S 2 


with 


f{dD) a point, say the north pole. 


It is an elementary topological result that the homotopy classes of such maps 
can be parametrized by their degree, namely up to a constant factor, with 
lo := dvol(S 2 ), the volume form of S 2 for some Riemannian metric, by 


/ /*(w) (in case / is smooth). 

Jd 

That f D f* (w), for smooth /, depends only on the homotopy class of / is a 
consequence of Stokes’ theorem. Also, one easily constructs / : D — > S' 2 for 
which this invariant is not zero. Consequently, not every map f : D —> S 2 
with f(dD ) a point is homotopic to a constant map. 


Corollary 8.2.7 There exist smooth maps f : D — > S' 2 mapping dD onto a 
point which are not homotopic to a harmonic map. 
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Proof. By Theorem 8.2.3, any such harmonic map is constant, while not every 
smooth map as in the statement is homotopic to a constant map. □ 


Perspectives. The method of holomorphic quadratic differentials associated to 
two-dimensional geometric variational problems was introduced by H. Hopf. He 
considered the case of closed surfaces of constant mean curvature in R 3 (cf. exercise 
5)). 

The applicability of the Hartman-Wintner Lemma to two-dimensional geometric 
variational problems was first discovered by E. Heinz. 

The special features of 2-dimensional harmonic maps, tying them to conformal 
geometry, extend to the Dirac- harmonic maps introduced in [47, 46]. 


8.3 The Existence of Harmonic Maps in Two 
Dimensions 

We start with some simple topological preliminaries. 

Let N be a manifold. 

Definition 8.3.1 7T2(.ZV) = 0 means that every continuous map 

p : S 2 -> N 

is homotopic to a constant map. 

Lemma 8.3.1 ir 2 (N) = 0 

4=> Any ho, hi £ C°(D,N) with h 0iaD = hi laD are homotopic. 

Proof. 

” 4= ” Take 77 : D — > S 2 bijective on D with rj(dD) = po- For ip : S 2 — > N 
define ho = ip ° f], hi = ipfjpo). 

” =4- ” Given ho, hi we define <p : S 2 — > N by 

ip(p) := h 0 (fi(p)) if|/i(p)|<l 
p{p) := hi(f 2 (p)) if |/ 2 (p)| < 1 

where fi, f 2 are the coordinate charts of §1.1. 
ip is continuous since ho l{]z]=1} = hi l{{z[ai y 

If 7 t 2 (N) = 0, there exists a continuous map 

L : S 2 x [0, 1] N 


with 
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L\S 2 x{0} = <P 
L\S 2 x{i} = const. 

We now define a homotopy 


H : D x / -> N 


by 

H(z,t) := L(ff 1 (2z),2t) for \z\ < ^,0 < t < ^ 

H(z,t) := L(f 2 1 (‘2z), 2(1 - t)) for \z\ < ^ < t < 1 

H(z,t) := L(/r 1 ( |^| ),4£(1 - |z|)) for ^ < |z| < 1, 0 < t < X - 
#0,t) := -b(/ 2 " 1 (|7|)j4(l — t)(l — |z|)) for ^ < |z| < 1, ^ < t < 1. 

Then ff is continuous, ff|{| 2 |=i} x {t} = = ^i|{|*i=i} f° r L and 

i?|Dx{o} is homotopic to ho, IL|dx{i} to hi. □ 


Remark. While the proof is formal, the claim of Lemma 8.3.1 should be 
geometrically obvious. 

The first aim of this section is the proof of 

Theorem 8.3.1 Let S be a compact Riemann surface, N a compact Rie- 
mannian manifold with 

n 2 (N) = 0. 

Then any smooth <p : S — > N is homotopic to a harmonic map f : E — > N. f 
can he constructed as a map which minimizes energy in its homotopy class. 

We need to establish some auxiliary results before we can start the proof 
of Theorem 8.3.1. 

We say that a continuous map 

h: M -> N 

between differentiable manifolds is of Sobolev class if it is of this class 
w.r.t. any coordinate charts on M and N. If M is compact, we can then also 
define Sobolev classes H k ’ p for continuous maps. For a better discussion of 
Sobolev spaces, see §8.4 below. 

Lemma 8.3.2 Let N be a Riemannian manifold, B 0 C Bi C N, B 0 ,Bi 
closed. Let n : B\ — > Bq he of class C 1 , 



422 8. Harmonic Maps 


k\b 0 = id\B 0 ( 8 - 31 ) 

and 

||r>7r(w)|| < ||w|| for every x £ B]\B 0 ,v £ T x N,v ± 0. (8.3.2) 

Let M be a Riemannian manifold with boundary dM , and let 

h £ C° (1 Bi) 

h(dM) c B 0 (8.3.3) 


be energy minimizing in the class of all maps from M into B\ with the same 
boundary values as h. 

Then 

h(M) C B 0 . (8.3.4) 


Proof. Let us assume that 

£2 := h-^B^Bo) ± 0 . 

Since h is continuous, £2 is open, and since h(dM) C B 0 , h cannot be constant 
on £2. 

Thus 

E{h\o) > 0 . 

But then by (8.3.2), since n o h £ H 1,2 as 7 r G C 1 , 

E{ 7r oh) < E(h ) 

contradicting the minimizing property of h (Note that 

(tt o h)\ dM = h\ dM by (8.3.1) and (8.3.3).) 

Therefore £2 is empty. □ 

Lemma 8.3.3 Let N be a Riemannian manifold, Bo C B\ C N, Bq,Bi 
compact. Suppose that every point in B±\Bo can be joined inside B\\Bq to 
dBo by a unique geodesic normal to 8Bq. Also assume that for any two such 
geodesics 7 i (t), 72(4), parametrized by arc length (t > 0) with 7*(0) £ dB 0 , 
i = 1,2, we have 


d(7i(*)>72(*)) > ^(7 i( 0)>72(0)) for t > 0. (8.3.5) 

Then the conclusion of Lemma 8.3.2 holds. 

Proof. We define 7r : Bi — > B 0 as the identity on B 0 and the projection 
along normal geodesics onto dB 0 on Bi\B 0 , i.e. if 7 (t),t > 0, is a geodesic 
normal to dB 0 inside Bi\B 0 , with 7(0) £ dB 0 , then 77(7(4)) = 7(0). This map 
satisfies all the hypotheses of Lemma 8.3.2, except that it is only Lipschitz, 
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but not C 1 . It is not difficult, however, to approximate 7 r by maps of class 
C 1 satisfying the same hypothesis, and the result then easily follows from 

Lemma 8.3.2. □ 

Lemma 8.3.4 Let N be a Riemannian manifold, p £ N, i(p ) the injectivity 
radius of p, and suppose that the sectional curvature of N is bounded from 
above by n, and let 

0 < P<\ min(i(p), -^=). (8.3.6) 

3 2y/K 

Let M be a Riemannian manifold with boundary dM, and let h £ C° C \ 1 
H 1,2 (M,N) with 

h(dM) C B(jp, p) = {q£ N : d(p , q) < p}. (8.3.7) 

If h minimizes the energy among all maps with the same boundary values, 
then 

h{M)cB{p,p). (8.3.8) 


Proof. By (8.3.6), we can introduce geodesic polar coordinates (r,ip) on 
B(p,3p) (0 < r < 3 p). We now define a map 7r : N — > B(p,p) 1 given in 
these coordinates by 

n(r,ip) = ( r,ip ) if r<p 

= (\p-\r,ip) if p < r <3p 
n(q) =p if q G N\B(p, 3p) . 


Thus, 7T maps concentric spheres of radius < 3 p onto concentric spheres 
of possibly smaller radius. It is clear that on B(p,3p)\B(p, p), tt is length 
decreasing in the r-direction. In order to see that tt is also length decreasing 
in the (^-directions, let y(s) be a curve given in our coordinates by (r, <p(s)), 
i.e. a curve in the distance sphere dB(p , r). For each fixed s, c s (t) := (t, <£>(s)) 
is a radial geodesic with c s (0) = p,c s (r) = y(s). Thus 

Js (t) ■■= ^ C s(t) 

is a Jacobi field, and 

7(s) = J a (r),0 = J a (0) (8.3.9) 

and 

Dtt( , j(s)) = J s (r'), where ( r',<p ) = n(r,ip), (8.3.10) 

i.e. r' < p < r < 3p (8.3.11) 


The Rauch comparison theorem (Theorem 4.5.1) implies that (assume y(s) 

0) 


|T a (r)| sin (y^r) 
•/«(/•') “ sin(v / Kr / ) 


• / o' 

since r < r < 3p < - _ . 

2v k 


(8.3.12) 
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Consequently by (8.3.9), (8.3.10), (8.3.12) 

|Dtt( 7 (s))| < | 7 (s)|, if y(s)) ^ 0. (8.3.13) 

Therefore, 7 r is also length decreasing in the (^-directions. 

7 T is not C 1 , but only Lipschitz. It can, however, be approximated by 
C-maps with the same length decreasing properties, and Lemma 8.3.2 then 
again gives the result. □ 

We shall also need the Courant-Lebesgue-Lemma. 

Lemma 8.3.5 Let N be a Riemannian manifold with distance function #, •), 

u G H l ' 2 {D,N) 

with 

E{u) < K. (8.3.14) 

Then 

Vcco G D 1 5 G (0, 1) 3 p G (c>, Vd) V Xi,X 2 G D with \xi — x$\ = p (i = 1, 2) : 

d(u(x 1 ),u(x 2 )) < |. 87r ^i • (8.3.15) 

(log j) 2 


Proof. We first recall the following property of an H 1,2 function u : 

For almost all r > 0,u^g B t Xo ^ is absolutely continuous. (See Lemma A. 1.2.) 
Then for any such r and x±, x 2 G D with \xi — xq\ — r,i = 1,2, we have 

d(u(x i),u(x 2 ))< [ \\—^r-^\\d<p (8.3.16) 

J o <# 

in polar coordinates (r,ip) with center Xq, w.l.o.g. B(xo,r) C D\ otherwise, 
the integration in (8.3.16) is only over those values of ip which correspond to 
dB(x 0 ,r)nD. 

By Holder’s inequality 

II ||# < (27T) * || II 2 #) 1 . (8.3.17) 

The energy of u on B(xo,r) is 

Consequently, there exists p G (<5, y/S) with 


r f# < 2 E ^o, P )) < = 

Jo dp fVs I dr — 5 log log | 

J 8 r 


(8.3.18) 
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The claim follows from (8.3.16), (8.3.17), (8.3.18). □ 

As an intermediate result for the proof of Theorem 8.3.1, we now show 

Theorem 8.3.2 Let N be a complete Riemannian manifold with sectional 
curvature < k and injectivity radius io > 0 ,p £ N. Let 

0 <r (8.3.19) 

Suppose g : dD — > B(p,r) C N is continuous and admits an extension 
g : D — > B(p,r) of finite energy. 

Then there exists a harmonic map 

h : D — > B(p, r) C N 


with 

h\ dD = 9, 

and h minimizes energy among all such maps. 

The modulus of continuity ofh is controlled by r, k, Efg), and the modulus 
of continuity of g, i.e. given e > 0, there exists 6 > 0 depending on r,K,g 
such that \xi — xf\ < 6 implies d{h{xf),h(x 2 )) < e. Finally, for any a > 0, 
the modulus of continuity ofh on {z :\z\<l — a} is controlled by cr,r,n, and 

E (V)- 

Proof. We choose r' with 

r < r' < min(^, -^=). (8.3.20) 

Using the Rauch comparison theorem as in the proof of Lemma 8.3.4, one 
sees that 

7T : B(p,r') -> B(p,r), 

with 7T| B(p,r) = id, and projecting B(p,r')\B(p,r) onto dB(p,r) along radial 
geodesics satisfies the assumptions of Lemma 8.3.3. 

As a first and preliminary application we show that any two points pi,P 2 G 
B(p , r) can be joined inside B(p, r) (and not just in N) by a unique shortest 
geodesic. For this purpose, we minimize 

E{c) 


in 

{c : [0,1] -> B(p,r') : c(0) =pi,c(l) =p 2 }- 


As in 1.4, the infimum is realized by some curve Co with image in B{p,r'). 
Because of the distance decreasing properties of 7 r, Lemma 8.3.3 (with 
B 0 = B(p,r),B i = ( B{p,r ')) implies that the image of Co is actually con- 
tained in the smaller ball B(p, r). Therefore, we may perform arbitrarily small 
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variations of Co without leaving B{jp,r'). Therefore, Cq is a critical point of 
E. hence geodesic by Lemma 7.2.1. Since pi,P 2 G B(p, r), they can be joined 
inside B(p,r) by a curve of length < 2 r < to- Therefore, Cq is the unique 
shortest geodesic between pi and p 2 by the definition of the injectivity radius 
'to- This proves the claim about geodesic arcs. We note that Co is free from 
conjugate points, again by Rauch’s comparison theorem (Theorem 4.5.1). 

In order to find the harmonic map, we now minimize the energy in 

V := {r£ H 1,2 (D, B(p, r')), v — g € Hq’ 2 (D, B(p, r'))} 


(the latter is the weak formulation of the boundary condition). Since B{p, r') 
is covered by a single coordinate system, namely normal coordinates, the 
iL 1 - 2 -property can be defined with the help of these coordinates. 

A minimizing sequence has a subsequence converging in L 2 by Theorem 
A. 1.8. We shall see below (Theorem 8.4.2), in order not to interrupt the 
present reasoning, that E is lower semicontinuous w.r.t. to L 2 convergence. 
Therefore, the limit h minimizes energy in V. By Lemma 8.3.3 again, h{D) is 
contained in the smaller ball B(p,r), hence a critical point of E because we 
may again perform arbitrarily small variations of h without leaving the class 
V. 

We now want to show that h is continuous and control its modulus of 
continuity. 

Let q G B(p,r),vi,v 2 G T q N with H^H = l,i = 1,2, 


Ci(t ) = exp q (tvi). 


By Rauch’s comparison theorem (Theorem 4.5.1) again, as in the proof of 
Lemma 8.3.4, 

d(ci(f),c 2 (f)) > d(ci(e),c 2 (e)) (8.3.21) 


for 


With 


for any 0 < e < £q, 


7 r 

£ < t < —j= — e . 
V K 


7 r 

£o := ~j= - 2 r, 
V K 


B 0 := B(q, e) n B(p, r), B x := B(p , r) 

satisfy the assumptions of Lemma 8.3.3, as any geodesic 

c(t) := exp g tv, ||v|| = 1 (v G T q N, q G B(p, r)) 

leaves B(p,r) for t > 2r (i.e. c(t) G B{p,r) t < 2r; this is a consequence 
of (8.3.19) and the resulting uniqueness of geodesics in B(p,r)). 

We now apply the Courant-Lebesgue Lemma 8.3.5. Since h is energy 
minimizing, 
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E{h) < E(g). 

For 0 < s < so, we compute <5 e (0,1) with 


8 t r_E(g) 

log? 


) 5 < 


(8.3.22) 


For any Xq £ D, by Lemma 8.3.5 there exists p, S < p < VS, with the 
property that for any X\,X 2 £ D with \xi — x$\ = p (i = 1, 2), 

d(h(xi), h{x 2 )) < e, (8.3.23) 


hence 

h(dB(xo, p) l~l D) C B(q,s) for some q £ N. (8.3.24) 

Since g is continuous, there also exists S' > 0 with 


d(g(yi),g(y 2 ))<e (8.3.25) 

whenever y -\ , y -2 G dD satisfy \y\ — 2 / 2 1 < S’ . 

We now require in addition to (8.3.22) that also 

Vs<s'. 


Since h\go = <?, with p as above we then have 

h(d(B(xo, p) D)) C B(q,e) for some q £ N. (8.3.26) 

(■ d{B(x 0 , p)C\D) = (dB(x 0 , p)C\D) U {dD n B(x 0 , p)). 

Lemma 8.3.3 then implies 

h(B(x 0 , p) D D) c B(q, e). (8.3.27) 

Likewise, |cco| + p < 1, then d{B{x 0 , p) D D) = dB(xo,p) fl D, and so in this 
case, we do not need g to control h on d(B(x 0 , p)C\D). 

In particular 

h{B{x 0 , S)nD) c B{q 7 e) (8.3.28) 

for any xq £ D and some q £ N (depending, of course, on xo ). (8.3.28) is the 
desired estimate of the modulus of continuity. The proof of smoothness of h 
is postponed until after the proof of Theorem 8.3.2 - see Theorem 8.5.1. □ 


Remark. We actually shall only need the weaker result that there exists 
r 0 > 0 with the property that for any r £ (0,?’o), the conclusion of Theorem 
8.3.2 holds. As an exercise, the reader should simplify the preceding proof in 
order to show this weaker statement. On the other hand, the injectivity radius 
io in (8.3.19) can easily be replaced by i o(r) := min{i(g) : q £ B{p, r)}, where 
i{q ) is the injectivity radius of q 1 without affecting the validity of the above 
proof. This remark is interesting for complete, but non compact manifolds 
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N. In this case, one may have = 0, but one always has io(r) > 0 for any 
r > 0 as N is complete. 

Finally, D may be replaced in Theorem 8.3.2 by any compact Riemann 
surface E with boundary BE, with only trivial modifications of the proof. 


Proof of Theorem 8.3.1: We put 

[ip] := {v £ C°n H 1,2 (E, N) : v is homotopic to ip). 


We choose 

P'=\ min(* 0 (iV), -|s=), (8.3.29) 

3 2\/k, 

where io(iV) is the injectivity radius of N, and k > 0 is an upper bound for 
the sectional curvature of N. We choose So < 1 to satisfy 


( 8ttE(<p) i p 
1 2 ' 


(8.3.30) 


For every 6 £ (0, ho), there exists a finite number of points Xi £ E, i = 
1 ,... ,m = rn{5), for which the disks B(xi, |) cover E. Here, we may define 
the disks B(xi, |) w.r.t. any conformal metric on E. We may also arrange 
things so that around each Xi, there exists a coordinate chart fi with image 
containing 

{z G C : | f(Xi) -z\<l} 


and put 

B(xi,S) := {z £ C : | f(xi) — z\ < <5}. 

We let (u„) ne f ! j be an energy minimizing sequence in [<p]. By definition of [ip], 
all u n then are continuous. Also, w.l.o.g., 


E(u n ) < E{ip) for all n. (8.3.31) 

Lemma 8.3.5 implies, recalling (8.3.30), that for every n £ N, there exists 
r n ,i G (A, \/$) and p n p £ N with 

u n (dB(xi,r n> i)) C B(p nA ,p). (8.3.32) 

On the other hand, if u n {dB{x,r)) C B{p,p) for some x £ E,r > 0,p £ N, 
then Theorem 8.3.2 (replacing D by B(x, r)) yields a solution of the Dirichlet 
problem 

h : B(x,r) — > B{p, p) harmonic and energy minimizing with 

(8.3.33) 

h\dB(x,r) ^ n\dB(x,r ) • 

We replace u n on B(x\,r n p) by the solution of the Dirichlet problem (8.3.33) 
for x = Xi,r = r n i. Outside B(xi,r n p), we leave u n unaltered. 

We denote the new map by u\. Since 7 r 2 (iV) = 0, by Lemma 8.3.1, vf, is 
homotopic to u n , hence to ip. Thus 
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e M- 

After selection of a subsequence, (r n> i) rae N converges to some rq G [A, \/<5] . 
By the interior modulus of continuity estimate of Theorem 8.3.2, the maps 
(u\) are uniformly continuous on B(x\,6 — rf) for any rj G (0,(5). Moreover, 
by Lemma 8.3.4, u\ minimizes the energy not only among maps into B(p , p), 
but among all maps into N with the same boundary values. 

Thus 

E{ui) < E(u n ). (8.3.34) 

Repeating the above argument, we find radii r n> 2 G (5, VS) with 
u\{dB{x2,r nt 2)) C B(p„ j2 ,p) 

for points p „ j2 G N. We replace u * on B(x 2 , r„ )2 ) by the solution of the 
Dirichlet problem (8.3.33) for x = x 2 ,r = ry ij2 . Again by selecting a subse- 
quence, (r„ j2 ) ne N converges to some r 2 G [(5, VS\. The new maps u ^ are again 
homotopic to ip, i.e. 

because 7 r 2 (7V) = 0. 

Since the maps u l n are equicontinuous on B(x\,5— ^) whenever 0 < 77 < (5, 
the boundary values for our second replacement are equicontinuous on 

dB(x2,r n ,2) r\ B(x!,6 - |). 

Therefore, using the estimates of the modulus of continuity in the proof of 
Theorem 8.3.2, the maps u \ are equicontinuous on B(x\, 6 — 77 ) UB(x 2 , 6 — rj) 
for any with 0 < 77 < <5. 

By Lemma 8.3.4 and (8.3.34) 

E(u 2 n ) < E{u\) < E{u n ) (8.3.35) 


as before. 

We repeat the replacement argument on disks centered at X3, , x m . 

We obtain a sequence v n := u™ G [p\ with 

E(v n ) < E(u n ) < E(ip) (8.3.36) 

which is equicontinuous on every disk B(xi , |), i = 1 ,...,m, hence on E 
because these disks cover E. 

After selection of a subsequence, (f n )nGN converges uniformly to some 
map u which then also is homotopic to p. (u ra ) n gf!j then also converges in L 2 
to u. 

By Theorem 8.4.2 we have the lower semicontinuity 
E(u) < liminf E{y n ) 

n —> 00 


(8.3.37) 
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Since u £ \<p\ and (u n ), hence also (v n ) by (8.3.36) was a minimizing sequence 
for the energy in [(/?], (8.3.37) implies that u minimizes energy in [ip\. In 
particular, u is energy minimizing when restricted to small balls. Either from 
this observation and Lemma 8.3.4 and Theorem 8.3.2 or alternatively directly 
from the construction of u, the modulus of continuity of u is controlled by 
the geometry of N , more precisely by io(N) and k. and by E{ip). Smoothness 
of u follows from Theorem 8.5.1. □ 

With the same argument, one also shows: 

Theorem 8.3.3 Let £ be a compact Riemann surface with boundary d£, 
N a compact Riemannian manifold with tt 2 (N) = 0, tp £ C° fl H 1,2 (£, N). 
Then there exists a harmonic map 

u : £ —> N 


homotopic to ip with 


U \8S — 


and u can be chosen to minimize energy among all such maps. 


□ 


Remark. If one does not assume tt 2 (N) = 0, one still obtains a harmonic 
map u : £ — > N with u\qz = p\gs by our reasoning. In that case, however, 
u need not be homotopic to ip any more, u can be chosen to minimize the 
energy among all maps with boundary values given by p. 


In the sequel, we shall need the following covering lemma: 


Lemma 8.3.6 For any compact Riemannian manifold M , there exists A £ N 
with the following property: whenever we have points Xi,...,x m € M and 
p > 0 with 

m 

X c |J B( Xi ,p) 

i = 1 

and 

x% 4- B(xj,p) foriy^j, 

then {1, . . . , m} is the disjoint union of A sets so that for all £ £ 

{1, • • • ,A} and h,i 2 £ h,i i ^ i 2 , 

B(x il , 2 p) n B(x i2 , 2 p) = 0. 


Proof. We construct I\ : We first put x\ := X\ and iteratively seek points 
Xj £ {xi, . . . x m } with 

4 p < d{x] j ,x 1 i ) for all i < j, 
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until no such point can be found anymore. I\ is the set of points selected so 
far. If Xk h, there exists Xj £ I\ with 

d(x k ,Xj) < 4 p. 

We construct It iteratively for l > 2 : We select any Xk i- uL\h, put 
x{ := Xk and iteratively seek points Xj £ {aq, . . . , a: m }\ Ua=i h with 

4 p < d(Xj,xf) for all i < j 

until no such point can be found anymore. 

If Xk It, then for each A < £ , we can find some £ I\ with 

d{xk,Xj W ) < 4 p. 

All these points are distinct, and their mutual distance is bounded from 
below by p by our assumptions. Therefore, there exists some Aq £ N such 
that there exists at most A 0 points satisfying the preceding inequality. 
The reader should by now have acquired enough familiarity with the local 
geometry of Riemannian manifolds to verify the existence of such a Aq with 
the required properties. The claim follows with A := Aq + 1. □ 

Remark. It is easy to see that one may always construct coverings satisfying 
the assumption Xi ^ B(xj,p) for i ^ j. 

We now come to the important phenomenon of splitting off of minimal 
2-spheres. Before giving a general theorem below, we first want to isolate the 
phenomenon in a simpler situation: 


Theorem 8.3.4 Let S be a compact Riemann surface, N a compact Rie- 
mannian manifold 

u n : £ — > N 

a sequence of harmonic maps with 

E(u n ) < K for some constant K 

Then either the maps u n are equicontinuous, and hence a subsequence con- 
verges uniformly to a harmonic map u : E — > N, or there exists a nonconstant 
conformal harmonic map 

v : S 2 —> N, 

i.e. a (parametric) minimal 2-sphere in N. 


Proof. Let 


A n := sup \\du n (z)\\. 
ze£ 


We distinguish two cases 
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1) sup A n < 00 

raSN 

Then (u n ) n gN is equicontinuous, because the derivatives are uniformly 
bounded. A priori estimates (see § 8.5) imply that also higher derivatives 
of (u n ) are equibounded. By the Arzela-Ascoli theorem, a subsequence con- 
verges uniformly, and by these regularity results the limit is also harmonic. 
Alternatively, the limit is continuous and weakly harmonic, hence smooth 
and harmonic by Theorem 8.5.1. 

2) sup A n = oo 

After selection of a subsequence, A„ tends monotonically to oo, and a 
sequence (Zn) n eN C £ with 

\\du n (z n )\\ = sup ||du n (z)|| (= A n ) 

has a limit point zq- 

We choose suitable local coordinates for which 


{z:\z- z 0 1 < 2} 


is contained in a coordinate chart. All local expressions will be evaluated in 
this chart. We put 

D n := {w£ C : \w\ < A„} 


and define 


D n 


N 


by 

By definition of A n 


v n (w) := u n (z 0 + — ). 

^ n 

sup ||dv n ( , u))|| = 1. 

w£D n 


By conformal invariance of E 


E(v n ) < K. 


As n — > oo, D n exhausts all of C. By regularity results for harmonic maps 
(see § 8.5) after selection of a subsequence, (f n ) n gN converges uniformly on 
compact subsets of C to a harmonic map 


v : C 


N. 


Actually, the convergence takes place even in C , by a priori estimates for 
harmonic maps, see § 8.5 and therefore 

ll<MO)|| = 1, 


and v is not constant. Also, E(v) < K. 
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The holomorphic quadratic differential defined by v, 

9ij(v(z))v l z v 3 z dz 2 

(( gij ) being the metric of N in local coordinates) therefore yields a holomor- 
phic function 

i>{z) = 9ij{v{z))vlv{ 

of class L 1 , since 

J \ip\ < E(v). 

c 

By a variant of Liouville’s theorem, see Lemma 8.3.7 below, 

ip = 0 

and it follows that v is conformal (see 8.2). It remains to show that v extends 
as a harmonic and conformal map 

v : S 2 N 

where we consider S 2 as CU{oo}. Thus, one has to show that oo is a removable 
singularity. In 8.4, it will be shown more generally that conformal harmonic 
maps of finite energy on a Riemann surface cannot have isolated singularities. 

□ 

Theorem 8.3.5 Let £ be a compact Riemann surface, possibly with boundary 
d£, N a compact Riemannian manifold, (p G C° D H 1,2 (£, N). Then there 
exists a harmonic map 

u: £->N 

homotopic to ip, with u\qs = <p\d£ in case d£ ^ 0, or there exists a nontrivial 
conformal harmonic map 

v:S 2 -*N. 

i.e. a (parametric) minimal 2-sphere in N. 

Proof. We only treat the case d£ = 0. The case d£ ^ 0 is handled with easy 
modifications of the argument for d£ = 0. 

We let 

P : = l min(i(7V), -^=), (8.3.38) 

where i(N) is the injectivity radius of N, and k > 0 is an upper curvature 
bound. 

We choose a conformal metric on £. All distances on £ will be computed 
w.r.t. this metric. 

We let 

ro := sup{I? >0: Vx G £3p G N : ip(B(x,2R)) C B(p, 3 -/1 p)}, (8.3.39) 
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where A is the integer of Lemma 8.3.6 for M = £. 

According to Lemma 8.3.6, there exist finite sets I\, . . . , Ia and points 
Xi £ £ with 

A 

£=U U B (xi,r 0 ) (8.3.40) 

1=1 iele 

and 

B(xi i , 2 r 0 ) n B(x h , 2 r 0 ) = 0 (8.3.41) 

whenever i i , *2 € h,ii ^ * 2 , for some t. 

We then replace p on every disk B(xi,2ro ) for i £ Ii by the solution of the 
Dirichlet problem (8.3.33) for x = Xi,r = 2rg. This is possible by Theorem 
8.3.1. Since the disks B(xi,2r 0 ) for i £ Ii are disjoint by (8.3.41), we can 
carry out these replacements simultaneously. We obtain a map 

ul'.Z^N 

with 

E(ul) < E(ip) (8.3.42) 

as in the proof of Theorem 8.3.1. 

Since 

ul{B{x u 2 r 0 )) C B( Pi , 3 ~ A p) (8.3.43) 

for every i £ I\ and some pi £ N by the maximum principle Lemma 8.3.4, 
we obtain from the definition of ?’o and the triangle inequality 

ul(B(x, 2 ro)) C B(p , 3 ~ A+1 p) (8.3.44) 

for every x £ £ and some p £ N (depending on x). 

Having constructed Uq for 1 < £ < A — 1, we construct itg +1 by replacing 
uf, on every disk B(xi,2ro), i £ Ie+ 1 , by the solution of (8.3.33) for x = 
Xi,r = 2 xq. We obtain 

E{u l + l ) < E( U q) (8.3.45) 

and 

u e +\B(x, 2 r 0 )) C B(p, 3 ~ A+e+1 p) (8.3.46) 

for every x £ £ and some p £ N (depending on x) . 

We thus arrive at a map 

Mi := Uq : £ — > N 

with 

E(u i) < E(ip) (8.3.47) 

and 

ui(B(x, 2?’ 0 )) c B(p,p) 
for every x £ £ and some p = p(x) £ N. 
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Having iteratively constructed u n : £ — > N, we construct u n+ 1 by replac- 
ing <p by u n and r 0 by 

r n = sup{i? >0: \/x£E3p£N: u n (B(x, 2R)) C B(p,3~ A p)}. 

The maps (u n ) ra gN satisfy 

E(u n ) < E(u n - 1 ) < E(<p) (8.3.48) 

We now distinguish two cases 

1) s := inf r n > 0 
neN 

We claim that in this case (u ra )neN converges to a harmonic map u : £ — > 
N homotopic to p. 

We shall first show that the u n are equicontinuous. We note that for every 
n, there exist finite sets I \ , . . . , I a and points Xi € E (everything depending 
on n, except for A) with 

A 

£= U U ( 8 - 3 - 49 ) 

1=1 ieh 

B(xi 1 ,2r n ) n B(x i2 ,2r n ) = 0 (8.3.50) 

whenever i\ ^ i 2 ,h,i 2 G h for some £, by Lemma 8.3.6 again. 

By (8.3.49), for every x £ £, there exists some i € (JjLi h w if^ 

B(x,s ) C B(xi,2r n ) (8.3.51) 

There exists £,1 < £ < A, with i £ I(. Therefore 

,.e 

U n\B(x,s) 

is harmonic, since it is even harmonic on the larger disk B{xi,2r n ) (u^ is 
constructed in the manner as Uq with u„ instead of ip.) 

Given e with 0 < e < p we consider S with 

0 < <5 < min(l,s) (8.3.52) 

and 

/ 8nE(tp) 

V logi 

For every x £ E, and n £ N there exists Ri(x) with 

5 2 < Ri(x) < S 

and some p\ £ N with 

u e n (dB(x, Ri(x))) C B{p !,3~ A £) (8.3.54) 

by Lemma 8.3.5. Here £ is chosen as in (8.3.51), i.e. so that i £ It for the i 
occuring in (8.3.51). 
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Since 

t 

^ n\B(x,Ri(x )) 

is harmonic and energy minimizing from Lemma 8.3.4 and (8.3.54) 

u e n (B(x, Ri{x))) C B(pi, 3~ a s). (8.3.55) 

We likewise find R 2 (x ) with 


S 3 < R 2 ( x) < S 2 


and 

u e + 1 (dB(x,R 2 (x))) C B{p 2 ,3 A e) 


for some p 2 G N. u ^, +1 need no longer be harmonic on B(x,R 2 (x)). It is only 
piecewise harmonic in case 


7 := B(x, R 2 (x)) n [J dB(xi, 2 r n ) 0. 

i€le + i 


Since 

4 +1 (7) = u e n ( 7 ) C B{p 1: 3 ~ A £ ) 


and 

u t + 1 {^ndB{x,R 2 {x))) C B{p 2l 3~ A £), 


we obtain 

ui j ~ 1 {'7'AdB(x,R 2 {x))) C B(p 2 ,3~ A+1 £). 


Therefore, the image of the boundary of every subregion of B(x, R 2 (x)) on 
which ?4 +1 is harmonic is contained in B(p 2 , 3 _yl+ 1 e), and since of course 
all maps are energy minimizing on these subregions, Lemma 8.3.4 gives as 
usually 

u e n +1 (B(x,R 2 {x))) C B(p 2 ,3~ A+1 £). (8.3.56) 

Iterating, 

we obtain 

R(x) > S A 

(8.3.57) 

and p = p{x) € N with 



u n +i(B(x, R(x))) C B{p,e) 

(8.3.58) 


(note u n+ 1 = u A ). 

This proves equicontinuity, since S and A are independent of u and x. 
Therefore, after selection of a subsequence, (u„)„ 6 n converges to some 
map u homotopic to p, and by (8.3.48) and lower semicontinuity of E (cf. 
Theorem 8.4.2)) 

E(u) < lim E(u n ) < E{ip). (8.3.59) 

n—* oo 

We want to show that u is harmonic. 
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Replacing r n by s, we may assume that the points Xj, i G Die, are inde- 
pendent of n. (One may assume, by selecting a subsequence, that the points 
Xi{n ) converge to points x*, and also r n — > s as n — > oo.) 

We first claim that with (u ra )„ e N also (u* ) n gN converges to u, and that u 
is harmonic on every disk _B(x,,s) for i G I[ . 

Since 

E(u n + 1 ) = E(v,n) < E(v} n ) < E(u n ), (8.3.60) 

lim (%) - £(«*)) = 0. (8.3.61) 

n—> oo 

Therefore, on each disk B(xi,s),i G I\ 1 for sufficiently large n the energy 
of u n deviates only by an arbitrarily small amount from the energy of the 
energy minimizing map 

U n\B(xi,s)- 

Consequently, considering the gradient DE of the energy as in 7.2, we obtain 

DE(u n \B( XitS )) — > 0 for i G Ii (see (7.2.3) for comparison). 

Since the maps u n converge uniformly, the same argument as in the proof of 
Theorem 7.2.1 shows that 


^1 B(xi,s) — lim U n l B(xi,s) 
n—> OO 

is harmonic (and energy minimizing), and then also 

u \B{ Xi ,s) = lim uh B , s) (8.3.62) 

n—>oo 1 v ’ 

(i e Ji). 

Having iteratively shown that (u^)„ £ n f° r some £, 1 < t < A—l, converges 
to u and that u is harmonic on every disk B(xi , s) for i G I(, we show in the 

same manner that (u( [ +1 ) Il gN likewise converges to u and that u is harmonic 

on every disk B(xi,s),i G h+i- 

We conclude that u is harmonic on B(xi,s) for every i G Ie and every 
t G {1, . . . , A}, hence on all of E. 

2) The second case is 

inf r n = 0. 

new 

By selecting a subsequence, we may assume that (r n ) n6 N is monotonically 
decreasing and converges to 0. 

By definition of r n , for every u, there exist points yo,Ui G E with 

d(yo, yi) = 2r n (8.3.63) 

d(u n (y 0 ),u„(yi)) > 3 _1 p = ; Po (8.3.64) 

We choose local coordinates around yo and denote the coordinate represen- 
tations of yo and y\ again by yo and yi resp. 



438 8. Harmonic Maps 


For z £ C, we put 

k„(z) := y 0 + r n z 

whenever this defines a point in our coordinate chart, and 

u n (z) := u n {k n {z)). 


We thus have maps 

• El n * N 

with C C and f2 n — > C as n — * oo (i.e., in the limit, the domain of 
definition of k n becomes the whole complex plane C, since r n — > 0) . Since k n 
is conformal, the maps u n are piecewise harmonic in the same manner the 
maps u n are (see Corollary 8.2.3). 

The maps u n now are equicontinuous by the same argument as in case 1 
for s = 1 because for every Wq £ (with B(w o, 2) C ft n ) there exists p £ N 
with 

Un(B(w 0 , 2)) C B(p, 3" V), (8.3.65) 

by definition of r n , because k n (B(w 0 ,2)) is a ball of radius 2 r n (w.l.o.g., we 
may assume that the chosen metric on S coincides with the Euclidean one 
on our coordinate chart around yo, as a different metric would only introduce 
some fixed factor in our estimates for the ball radii on S and I7 ra .). 

Likewise, as in case 1, after selection of a subsequence the maps (u n ) 
converge uniformly on compact subsets to a harmonic map 

v:C -> N. 


Moreover, by Corollary 8.2.4 

E(u n \ Q n ) E(u n \k n (fz n )') 

< E(u n ), 

< E(<p), 

hence by lower semicontinuity of E (Theorem 8.4.2) 

E(y) < liminf E{u n ) < E(ip). 

n—> oo 

The holomorphic quadratic differential associated to v, 

9 jk{v{z))v 3 z v k z dz 2 

(( gjk ) being the metric of N in local coordinates), therefore defines a holo- 
morphic function 

■= 9jk(v(z))v J z v z 

of class L 1 , because 

J M < 2 E(v). 

c 
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Since every holomorphic function on C of class L 1 vanishes identically (this 
follows by applying Lemma 8.3.7 below to the real and imaginary parts of 
ip), we get ip = 0, and consequently v is conformal (see the discussion in 8.2). 

It remains to show that v extends as a harmonic (and then also conformal) 
map 

v:S 2 ^N, 

i.e. that the singularity at oo is removable. This will be achieved in § 8.5. □ 

Corollary 8.3.1 Let N be a Riemannian manifold with it 2 (iV) 0. Then 
there exists a nonconstant conformal harmonic v : S 2 — > N, i.e. a (paramet- 
ric) minimal 2-sphere in N. 

Proof. Since n^iN) yf 0, there exists ip : S 2 —> N which is not homotopic to 
a constant map. By Theorem 8.3.5 either ip is homotopic to a harmonic map 
v : S 2 — ► N which then is also conformal by Corollary 8.2.5, or if the second 
alternative of Theorem 8.3.5 holds, there also exists a conformal harmonic 
v:S 2 ->N. □ 


Lemma 8.3.7 Any harmonic function h defined on all K™ and of class 
L 1 (K") is identically zero. 


Proof. By the mean value property of harmonic functions on R", 


IM^o)! = 


Vol(B(x 0 ,R)) 


j h(x) dx for any M > 0, xq € R n . (8.3.66) 


\B(x 0 ,R) 


fh(x) dx < fm\d*<m\w 


\B(xq,R ) 


B(xq,R) 


the r.h.s. of (8.3.66) tends to 0 as R — > oo. Thus h(x 0 ) = 0. This holds for 
any x 0 eR". □ 


Perspectives. Theorem 8.3.1 is due to Lemaire[168] and Sacks and Uhlen- 
beck[210]. Theorem 8.3.5 is again due to Sacks and Uhlenbeck[210]. Other ap- 
proaches to these results were found by Struwe[233], Chang[38] and Jost, see [131]. 

The method of M. Struwe and K.C. Chang consists in studying the associated 
parabolic problem. Thus, given <p : E — > N, one studies solutions of 

/ : E x [0, oo) — > N 
f(z,0) = p(r) 
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where the tension field is computed w.r.t. the z variable. One can then show that 
a solution can develop at most finitely many singularities. These singularities cor- 
respond to the splitting off of minimal 2-spheres. In the limit t — > oo, one obtains 
a harmonic map /. 

The construction presented here is refined in [131]. There, also various existence 
results for unstable harmonic maps are presented. Any type of critical point theory, 
e.g. Morse theory, for harmonic maps in two dimensions has to take the splitting 
off of minimal 2-spheres into account. In certain instances, however, one may show 
that this phenomenon can be excluded. A prototype of such a result is the following 

Theorem. Let £ be a compact Riemann surface with boundary, N a Riemannian 
manifold diffeomorphic to S 2 (thus, the condition TT 2 (N) = 0 is not satisfied). Let 
g : dE — ► N nonconstant. Then there exist at least two harmonic maps /i, fa : E — > 
N with fi\ gs = g. 

This result is due to Brezis and Coron[31] and Jost[127]. 

In order to prove this theorem, one first minimizes the energy over all maps 
/ : E — > N with = g and obtains a harmonic u (see the remark after Theorem 

8.3.3). By careful comparison constructions one then exhibits another homotopy 
class a of maps from E to N (not containing u) with 

inf{I?(/) : f £ a} < E(u) + Area (N). 

One then shows that if minimizing energy in some homotopy class leads to the 
splitting off of a minimal two-sphere, the energy would be lowered by an amount 
of at least the energy of that minimal sphere. Since N is diffeomorphic to S 2 , the 
energy of such a minimal sphere would be at least the area of N. Since, however, 
u realizes the absolute minimum of energy among all maps with the prescribed 
boundary values, the above inequality excludes the splitting off of a minimal 2 - 
sphere during the minimization of the energy in the class a. 

We have described the preceding argument in some detail because it forms 
a paradigm for other conformally invariant variational problems (Yang-Mills equa- 
tions in four dimensions, constant mean curvature surfaces, Yamabe problem, etc.). 
Some further discussion of such limit cases of the Palais-Smale condition may be 
found in [234] and in the references given there. 

Returning to the critical point theory for two dimensional harmonic maps, we 
also mention Ding[56] and the survey article [132] where many further references 
can be found. 

In this context, we should also discuss the Plateau problem for minimal surfaces. 
In its simplest form, we consider a smooth (or, more generally, a rectifiable) closed 
Jordan curve 7 in R 3 and seek a minimal surface with boundary 7 . In the parametric 
version of the problem, we look for a harmonic and conformal f : D —> R 3 ( D = 
unit disk) mapping dD monotonically onto 7 (a monotonic map between curves is 
defined to be a uniform limit of homeomorphisms). In this form, the problem was 
solved by J. Douglas and T. Rado. The problem was then extended by Douglas to 
configurations of more than one disjoint curves 71 , . . . , 7 *, and/or minimal surfaces 
of other topological type. He found a condition (the so-called Douglas condition) 
guaranteeing the existence of minimal surfaces of some prescribed topological type. 
It was also asked whether one may find unstable minimal surfaces with prescribed 
boundary. The most comprehensive critical point theory for minimal surfaces in R 3 
was developed in Jost and Struwe[144] where also references to earlier contributions 
are given. 
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The Plateau problem in Riemannian manifolds (instead of just R 3 ) was solved 
by C. Morrey[189]. Results pointing into the direction of a general Morse theory 
for minimal surfaces in Riemannian manifolds may be found in Jost[131]. 

There also exists the geometric measure theory approach to minimal surfaces. 
Here, one tries to represent a minimal surface not as the image of a map of a 
Riemann surface, but directly as a submanifold of the given ambient space. In the 
parametric approach, one had to generalize the space of smooth maps to a Sobolev 
space, in order to guarantee the existence of limits of minimizing sequences. For 
the same reason, in the measure theoretic approach, the space of submanifolds has 
to be generalized to the one of currents. A submanifold of dimension k yields a 
linear functional on the space of differential forms of degree k by integration, and 
so the space of fc-currents is defined as a space dual to the one of fe-forms. One 
may then minimize a generalized version of area, the so-called mass, on the space 
of currents. This approach is valid in any dimension and codimension, in contrast 
to the parametric one that is restricted to 2 dimensions. If the codimension is 1 
and the dimension at most 7, then such a mass minimizing current is regular in 
the sense that it represents a smooth submanifold. Otherwise, singularities may 
occur. In particular, any smooth Jordan curve in R 3 bounds an embedded minimal 
surface, see Hardt and Simon[lll]. For a general treatment of the concepts and the 
approach of geometric measure theory, we recommend Federer[73] and Almgren[6]. 

Minimal surfaces in Riemannian manifolds have found important geometric 
applications. Let us mention a few selected ones. 

In the proof of the Bonnet-Myers Theorem (Corollary 4.3.1), we have seen how 
information about geodesics and their stability can be used to reach topological con- 
sequences for manifolds of positive Ricci curvature. This suggests that information 
about the stability of minimal surfaces may likewise be used to obtain restrictions 
on the topology of positively curved manifolds. The first instance of an important 
application of minimal 2-spheres in the presence of positive curvature is Siu and 
Yau[227j. Micallef and Moore[178] showed that minimal 2-spheres can be used to 
prove that any compact Riemannian manifold with positive curvature operator (i.e. 
R(-, •) = i? 2 (M) — > Q 2 {M) is a positive operator; this in particular implies positive 
sectional curvature) is diffeomorphic to a sphere. Also, the sphere theorem (see 
Short survey on curvature and topology, above) was proved under the assumption 
of pointwise pinching only (i.e. at each point, the maximal ratio between sectional 
curvatures is less than 4). 

There are also important applications of minimal surfaces in three-dimensional 
topology. The so-called Dehn Lemma, whose first complete proof was given by Pap- 
kyriakopoulos, asserts that if S' is a differentiably embedded surface in a compact 
differentiable three-manifold M and if 7 is an embedded curve on S that is ho- 
motopically trivial in M (i.e. [7] = 0 £ m(M)) then 7 bounds an embedded disk. 
Meeks and Yau[177] showed that in this case, if we equip M with a Riemannian 
metric in such a way that S is convex, the solution of the parametric Plateau prob- 
lem with boundary 7 is embedded. Thus, one obtains an embedded minimal disk 
bounded by 7. This represents an analytical proof of Dehn’s Lemma. The important 
fact is that we have found a canonical solution of the problem. Assume for example 
that some compact group G acts on M, leaving 7 invariant. One may then average 
the metric of M under the action of G and obtain a new Riemannian metric on 
M for which G acts by isometries. Since 7 is G-invariant, one may then also find a 
G-invariant minimal disk bounded by 7. If one chooses this disk to be area mini- 
mizing in its class, one may then show again that it is embedded. This equivariant 
version of Dehn’s Lemma of Meeks-Yau then has applications to the classification 
of discrete group actions on 3-manifolds, see [14]. 
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The Poincare conjecture asserts that any three-dimensional manifold M with 
7Ti (M) = 0 is diffeomorphic to the three-sphere S 3 . In order to approach this 
problem, it would be useful to obtain detailed information about the topology of 
possible counterexamples, in order either to construct them or to exclude their 
existence. Such counterexamples are called fake spheres. A fake 3-sphere has to 
contain a fake 3-ball, i.e. a manifold B with tti(B) = 0 and a 2-sphere as boundary. 
Meeks, Simon and Yau[176] proved that any such fake 3-ball can be surrounded by 
an embedded minimal 2-sphere which is locally area minimizing (here, again, we 
may equip M with any Riemannian metric). Thus, the possible presence of a fake 
3-ball leads to a tangible geometric consequence. So far, however, this line has not 
been pursued any further, and the Poincare conjecture is presently approached by 
the Ricci curvature flow in the work of Perel’man, as described in the Survey on 
Curvature and Topology after Chapter 4. 


8.4 Definition and Lower Semicontinuity of the Energy 
Integral. 


For the analysis of harmonic maps, it is necessary to consider classes of maps 
more general than C 1 . A natural space of maps is L 2 (M , N). One then needs 
to define the energy integral and derive conditions for a map to be a critical 
point of that integral. 

The idea of defining the energy functional is quite simple and may be 
described as follows: 

We let, for h > 0 

ah '■ R + — > R 

be some nonnegative function with cr^fs) = 0 for s > h and 


j a h (\x\) dx = 1 
B(0,h) 

where 13(0, /i) is a ball of radius h in R m (m will be the dimension of our 
domain M in the sequel). For x,y € M, we put 

Vh{x,y) := a h (d(x,y)). (8.4.1) 


The typical example we have in mind is 


Vh(s) 


l hm for 0 < s < h (com = volume of the unit ball in R m ) 
0 for s < h, 


(8.4.2) 

and so, r]h(x, •) is a multiple of the characteristic function of the ball B(x, h), 
for every x. That multiple is chosen so that the integral of r]h(x,-) w.r.t. 
the Euclidean volume form dy on B{x 1 h ) is 1, i.e. the one induced from the 
Euclidean volume form on T X M via the exponential map exp a , : T X M — > 
M. We note that by Theorem 1.4.4, the difference between the Euclidean 
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and Riemannian volume forms is of order 0(h m+2 ). The advantage of the 
Euclidean volume form is that the normalization does not depend on x so 
that r)h becomes symmetric in x and y. 

For a map f £ L 2 (M, N ) between Riemannian manifolds M and N, we 
then define 

Eh(f) ■= J J Vh(x, y) — dvol (y) dvol(x) (8.4.3) 

M B(x,h) 

where dvol is the Riemannian volume form on M. 

In order to understand the geometric meaning of the functionals Eh, we 
observe 

Lemma 8.4.1 f : M —> N minimizes Eh iff fix) is a center of mass for the 
measure f#(ph{x,y) dvol (y)) for almost all x £ M, i.e. if f(x) minimizes 

F(p)= J Vh{x,y)d 2 {p,f{y))dvo\{y) 

B(x,h ) 


Proof. If f{x) did not minimize F{p), then 

J ii h {x,y)d 2 {f(x),f(y)) dvo%) 

B{x,h ) 

could be decreased by replacing f(x) by some minimizer p. Since ph{x,y) is 
symmetric, that would also decrease Eh(f) if happening on a set of positive 
measure. □ 


It is also instructive to consider the following computation that leads to 
a proof of Lemma 8.4.1 in the smooth case. We consider variations 

ft(x) = f(x)+t<p{x) 

of /. If / minimizes Eh, then 

° = Jt Eh ^ |t=0 = P Jf J J Vh{x,y)d 2 ift(x), ft{y)) dvol(y) dvol(a;) 

= Jfd J J Vh(x, y){S7id 2 {f{x),f(y)){(p{x))+ 

S/ 2 d 2 ifix),fiy)){ipiy))} dvol (y) dvol(a:) 

= P / / dvol( P 

because of the symmetry of ph 

J J Vh(x,y)exp-( x) f(y)(p(x) dvol(y) dvol(x) 


2 

h 2 
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Since this has to hold for all smooth ip with compact support, 

J Vh(x, y) exp”^ f(y) dvol(y) = 0 

for all x. Thus f(x ) is the center of mass of f#{rih{x,y) dvol(y)). 

We now consider the functionals E e for h = e with the kernel rj e defined 
by (8.4.1), (8.4.2), and we let e — > 0 and define the energy E as the limit 
of the functionals E e . The functionals E e increase towards E, and it is not 
excluded that E(f) takes the value oo for some / £ L 2 (M,N). We shall 
see that E coincides with the usual energy functional for those mappings for 
which the latter is defined. Also, the functionals E e are continuous w.r.t. L 2 - 
convergence, and the limit of an increasing sequence of continuous functions 
is lower semicontinuous. We shall thus obtain the lower semicontinuity of the 
energy w.r.t. L 2 -convergence. 

Actually, the described monotonicity of the sequence E e as £ — » 0 only 
holds up to an error term that comes from the geometry of M. It is not hard 
to control this error term sufficiently well so that the desired conclusion about 
E can still be reached. 

Lemma 8.4.2 E e (f) is continuous on L 2 (AI,N), i.e. if {f v )v £ n converges 
to f in L 2 (M,N), then 

E e (f) = lim E e (f v ). 

V — KX) 

Proof. Elementary. □ 

We estimate for 0 < A < 1 
EeU) 

= ( 77^+2 / / d2 U{x)J{y))^o\{y)dvo\(x) 

M B(x,e ) 

~ u> m e m+2 I I { d (f( x ^f( x + X ( y ~ x ^ 

M B(x,e ) 

+ d(f(x + A (y - a;)), /(y))| dvol(y) dvol(a;) 

(by the triangle inequality) 

/ {d 2(/(;r) ’ /(:c+A(2/ “ x))) 

M B(x ,£ ) 

+ Yh. d2{f{x + X( ^ y ~ x ^’ dvo1 ^) dvol (^) 
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(using the inequality (o + b) 2 < x a? + , valid for any real numbers a, b). 

In local coordinates, with metric tensor {gij), we have 

dvol(y) = det(gij)^ dy 1 . . . dy m . 

By Corollary 1.4.3, we may assume that e is so small that Riemannian normal 
coordinates may be introduced on B(x, e). In those coordinates, we have from 
Theorem 1.4.4 that 

det (gij(y))% = 1 + 0(e 2 ) for y G B(x,e). 


Therefore 


dvol (y) 
dvol(A y) 


j-(! + °(e 2 ))- 


We then substitute z = Xy and obtain (noting that x has the coordinate 
representation 0) 


/ ^d 2 (/( 0), /(Aj/)) dvol(y) = -^-(l + 0(e 2 )) J d 2 (/(0), /(*)) dvol(*). 

B(x,e) B(x,\e) 


In that manner, we obtain 

E e{f)< - l +2 ^)){j ^ J d 2 (f(x),f{z))dvol{z)dvol{x) 

M B(x,\e) 

+ J ( 1 _\)m / d 2 (/(x), f{y)) dvol(y) dvol(z)} 

M B(z,( 1-A)e) 

= (1 + 0(s 2 ))(\E XE (f) + (1 - A )E {1 _ x)e (f)). (8.4.4) 

We put 

E n (f) := E 2 -n(f). 


Definition 8.4.1 The energy of a map f G L 2 ( M. N) is defined as 

E{f) = lim E n {f) = lim E e {f) G R U {+oo}. (8.4.5) 

n—> oo £ — >-0 

We also say that / G L 2 (M,N) belongs to the Sobolev space H 1 ’ 2 (M, N) if 
E(f) < oo. 


In order to make contact with more classical definitions of Sobolev spaces, 
we start with the following 


Definition 8.4.2 A map f : M — > N between manifolds is localizable if for 
every £0 G M there exists a neighborhood U of £0 in M and a domain V of 
a coordinate chart in N with the property that 
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f(U) c V. 

In the sequel, we shall look at maps which are localizable in the sense of Def- 
inition 8.4.2. For such maps, all relevant regularity properties can be studied 
in local coordinates. In particular, it can be defined with the help of local 
coordinates whether such a map between Riemannian manifolds is of Sobolev 
class H 1,2 (M , TV). 

We now want to establish the result that for such localizable maps, our 
general definition of the energy coincides with the one obtained by local 
coordinate representations. 

Theorem 8.4.1 For a localizable map f £ L 2 (M, TV), 

E(f) = d{m) J ( df , df) dvol(:r) 

M 

whenever the latter expression is defined and finite (where the weak derivative 
df is defined with the help of local coordinates) , and 

E(f) = oo 


otherwise. 

Here, d{rn) is some factor depending on the dimension of AI that can be 
safely ignored in the sequel. 

In the proof of Theorem 8.4.1, we shall employ the following auxiliary 
result: 

Lemma 8.4.3 For a localizable f, f £ i? 1,2 (M, TV) (M, TV compact) iff for 
all Lipschitz functions t : TV — > M, l o / £ H 1,2 (M, R). 

Proof. We have assumed / to be localizable, and so the i? 1,2 -property may be 
tested in local coordinates. Therefore, if the T4 1,2 -property holds for compo- 
sition with Lipschitz functions it holds for coordinate functions. Conversely, 
if / is in Ft 1 ’ 2 , then (.of is also in H 1,2 for all Lipschitz functions ( by Lemma 
A. 1.3. □ 

Proof of Theorem 8.4-1: For f £ C7 1 , it is an elementary consequence of 
Taylor’s formula that 

E(f) = UmE e (f). (8.4.6) 

£ — ►O 

For / £ Ft 1 ’ 2 (defined with the help of local coordinates), we choose a se- 
quence (fv)ven C C l converging to / in H 1 ’ 2 . Given 5 > 0, we find n 0 such 
that for all n, p, > u 0 
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\E(f u ) - E(f)\ < d ~. (8.4.7) 

We write 

Ee(fu) ~ EM = — ^ [ [ (d 2 (Mx),U(y)) 

J J 

M B(x,e) 

- d 2 (fn(x)’fv(y))) dvol (2/) dvol (a:) 

= i£^j J 

M B(x,e ) 

- d(U(x), U{y)))d(f v (x),f v (y)) dvol (y) dvol (a:) 

M B{x,e) 

- dvol(y)dvol(a:) . 

Now 


d(f„(x), U(y)) = J D 2 d(f u (x), f v {x + t(y - x)))(y - x)dt (8.4.8) 
o 


(for almost all y) where D 2 denotes the derivative w.r.t. the second variable, 
and we use local coordinates on B(x, e). This derivative exists a.e. by Lemma 
A. 1.3 since d is Lipschitz. 


Consequently 

1 , f 


tOmS m+2 


(d(fv(x), fv(y)) 


< 


M B(x,e) 

- d(fMJ^y)))d(U(x), f v {y)) dvol (y) dvol(z)| 
1 

r 1 [ I fv(x + t{y - x))) 


U>m£ m+2 


M B(x,e) 0 

- D 2 d(f lx (x)’fu( x + t(y - £)))| 2 |2/ - x\ 2 dt dvol(y) dvol (a;) 

/ d 2 (f v (x), f„(y)) dvol(y) dvol(x)| 2 , using Holder’s inequality 


M B(x,e) 


<E e (f v )t i _ { [ [ ( [ \D 2 d(f v {x),f v (x+t(y - x))) (8.4.9) 

“™ £2 J mbL)o 

1 

2 d(Mx),fv(x + t(y - a:)))| 2 dtdvol(j/)dvol(a ;)| 2 . 


-D 2 
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Since (/„) converges in H 1 ’ 2 , by Lemma 8.4.3 then (D 2 d(f u (x), 
converges in L 2 for every x. Therefore, given y > 0, there exists v\ > vq so 
that for all v, p > zq , the preceding expression is bounded by 

VE s (frf 

(For M a compact Riemannian manifold, and an integrable function 

ip: M — » R, f M f B ^ x f Q ip(x + t(y — x))dt dvol (y) clvol(x) behaves like 

w m £ m J M <p(z) dvol(z) as £ -> 0.) 

We thus obtain 

I E e {f u ) - E e (fn)\ < V (E s (frf + E e {frf). (8.4.10) 

From (8.4.8), we see that E E {f v ) is controlled by the energy E{f v ) and since 
the latter is bounded since it converges to E(f), we may assume 

E e (fv) < K for some constant K and all v. 

Hence by a suitable choice of r\ in (8.4.10), we have for all y, > u\ 

\E e (f v ) - E e (U)\ < d ~. (8.4.11) 

We then choose e > 0 so small that 

\E s (f^) - E(f Vl )\ < 6 - (8.4.12) 

which is possible by (8.4.6). From (8.4.7), (8.4.11), (8.4.12), we conclude 

| E e (f)-E(f)\<S 

for all sufficiently small e. This is the claim for f £ H 1,2 . 

In order to establish the result for general (localizable) f £ we 

show that if E e (f) stays bounded for £ — » 0, then / £ H 1,2 (M, N). For that 
purpose, we use the characterization of Lemma 8.4.3. 

Let £ : N — > R be Lipschitz. If E e (f) is bounded, so then is 

E e (£of) = f j \£ o f(x) — £ ° f (y)\ 2 dvol(y) dvol(;r). 

J J 

M B(x,e) 


Introducing Riemannian polar coordinates (r, p) on B(x,e) ( e sufficiently 
small, cf. Corollary 1.4.3), we compute 




\tof(x + ey) - £o f(x)\ 2 £ m 
72 -^dydvolfc), 


M B( 0 , 1 ) 


up to an error term that goes to 0 for £ — > 0. Since this is assumed to be 
bounded as £ — > 0, for almost all y £ B(0, 1), the difference quotients 
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AU£of)(x) = 


Zof(x + ey)-Zo f(x) 


are uniformly bounded in L 2 . By Lemma A. 2. 2, we conclude that Zo f £ H 1 ’ 2 . 
Since this holds for every Lipschitz function Z, by Lemma 8.4.3, f £ H 1,2 . 
This completes the proof. □ 

We now want to show the lower semicontinuity of the energy E w.r.t. 
L 2 -convergence . 

Theorem 8.4.2 If (/„)„eN converges to f in L 2 (M,N), then 

E(f) < liminf E{f v ). 


Proof. We may assume 

liminf E(f v ) < oo, 

v — >oo 

hence also 

E(U) < I< (8.4.13) 

for some constant K and all v. By definition 

E(f) = lim E n (f). 

n—> oo 

Given 6 > 0, there then exists n 0 such that for all n > n 0 

E(f)<E n (f)+S. 

By Lemma 8.4.2, E n is continuous on L 2 . Hence there exists v$ such that for 
all v > v o 

E(f) <E n °(f„) + 26 (8.4.14) 

(vq depends on S and no). 

Applying (8.4.4) with A = we obtain 

E n (U) < (1 + 0(2 ~ 2n ))E n+1 (f„). 

Possibly choosing no larger, we obtain for all n > no 

E n {f v ) < E{f v ) + <5 (8.4.15) 

using (8.4.13). 

(8.4.14) and (8.4.5) imply 

E(f) < E(f v ) + 3<5 for all v > u 0 . 

Since 6 > 0 was arbitrary, the claim follows. □ 

We now wish to relate the above results to a general concept of varia- 
tional convergence, the T-convergence in the sense of de Giorgi. In order to 
introduce that concept, let Z be a topological space satisfying the first axiom 
of countability 6 ; that means that for every x £ Z, we may find a sequence 


this is assumed only for the simplicity of presentation; the concept is meaningful 
also for spaces that do not satisfy the first axiom of countability; one has to 
replace sequences by filters in that case. 
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(C/ !y ) l/S N of open subsets of Z such that every open set containing x also con- 
tains some [/„. In our applications, Z of course will be L 2 (M,N) or some 
subspace of that space. 

Let 

F n : Z — > R. U {±oo}, n G N, 
be a sequence of functionals. 

Definition 8.4.3 The functional 

F : Z — > R. U { Too} 
is the P-limit of (F n )n£N, written as 

F = r — lim F n 

n — >N 


if 

(i) whenever (a; n )„gN C Z converges to x £ Z, 

F(x) < liminf F n (x n ) 

nGoo 

(ii) for every x G Z, we can find a sequence (x n ) n ^ C Z that converges 
to x and satisfies 

F(x) = lim F n (x n ). 


Lemma 8.4.4 E = r — lim.E e w.r.t. L 2 -convergence. 

Proof. By monotonicity (see (8.4.4)), it suffices to show the result for E n 
instead of E e . 

(i) For every / G L 2 (M, N), there exists a sequence (/^sk C L 2 (M , N) 

E{f) = lim E v (f v ). 

is— >oo 

According to the definition of E , we may simply take f v = f for all 
v. 

(ii) For every sequence C L 2 (M, N ) converging to / we have 

E(f) < lim inf ^(/„). 

IS— too 

From the definition of E, for any 8 > 0 there exists no G N such that 
for v > no 

E(f) < E"(f) + 8. 

Using this estimate and that E u is continuous on L 2 by Lemma 8.4.1, 
we may find vq (depending on 8 and no) such that for v > vq 


E(f)<E n °(f v ) + 26. 
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From (8.4.2) with A = we get 

E n {f v ) < (l + c2~ 2n )E n+1 (f„), 
for some constant c, depending on the geometry of M. 

We may have chosen no in the preceding also satisfying 

H (1 + c2~ 2n ) <1 + 5. 

n>no 

Then from the preceding estimate 

E no (U) < (1 + S)E-'(U) for i/ > n 0 . 

Putting the estimates together, 

E(f) < (1 + S)E v {f v ) + 26 for v > ?r 0 , n 0 . 

As this holds for any S > 0, 

E(f) < liminf E v (f v ). 

V — MX) 

□ 

This result is quite useful, because, in view of the next lemma, it tells us 
that if for some sequence e n — > 0, we can find a minimizer /„ for every E £n 
and if this sequence converges to some /, then / automatically minimizes 
E. In other words, we can find a minimizer for E by minimizing the simpler 
approximating functionals E s . 

Lemma 8.4.5 Let 

F = r — lim F n 

n — >oo 

in the above setting. Assume that every F n is bounded from below, and that 
x n minimizes F n . If x n converges to x £ Z, then x minimizes F, and 

F(x) = lim F n (x n ). (8.4.16) 

n—> oo 


Proof. Let z £ Z. 

Since F is the T-limit of the F n , we can find some sequence (2 n ) n eN 
converging to z with 

lim F n (z n ) = F{z). 

n—> oo 

Given e > 0, we choose n £ N so large that 

F n (z n ) < F(z) + | 

and also 
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(property (i) of /^-convergence) 


F n (x n ) > F(x) - - 


Since x n minimizes F n , 


Altogether 


F n (x n ') A F n (Zn). 


F(x) < F(z) + £. 

Since this holds for every z £ Z and every e > 0, x minimizes F. By _T- 
convergence 

F(x) < liminf F n (x n ), 

n—* oo 

and we may find a sequence (z ra )neN converging to x with 

F(x) = lim F n (z n ). 


Since 

Fn(Xn) ^ F n (Zn) 

because of the minimizing property of x n , (8.4.16) follows. 


□ 


A-limits are automatically lower semicontinuous, and so, we could have 
deduced Theorem 8.4.2 from that general result about T-convergence. 

Perspectives. The definition and treatment of the energy functional presented 
here are taken from Jost[133]. (See also [134].) A similar theory is developed by Ko- 
revaar and Schoen[162]. For the usual definition of the Sobolev space H 1,2 (M, N), 
see exercise 8). The concept of T-convergence is treated in dal Maso[53] and Jost, 
Li-Jost[142]. 


8.5 Weakly Harmonic Maps. Regularity Questions 

Definition 8.5.1 / £ M,N being Riemannian manifolds, is a 

critical point of the energy integral E if 

^E(exp f tip)\ t = o = 0 (8.5.1) 

whenever if) is a compactly supported bounded section of f~ l TN of class 
H 1 ' 2 , i.e. 

J (dtp , dtp) dM < oo 

M 

(Cf. (8.1.11) for the definition of dtp ; all partial derivatives are to be under- 
stood as weak derivatives.) 
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Lemma 8.5.1 f £ // (o ’ c (M.N) is a critical point of E iff 

j (df, dif)dM = 0 (8.5.2) 

M 

for all if as in Definition 8.5.1. 

Proof. This follows from the computation of ^E(ex pj tip) leading to (8.1.13). 

□ 

Definition 8.5.2 A solution of (8.5.2) is called weakly harmonic. 

Corollary 8.5.1 The weakly harmonic maps are the critical points of E. □ 
Lemma 8.5.2 / £ H^(M, N) is weakly harmonic if in local coordinates 

( 8 -«> 

M 

= -/ (/(*)) ^ ^n^dx 1 ...dx m 

M 

for all rj £ Hf 2 n L°° (w.r.t. local coordinates) . 

Proof. This follows from the proof of Lemma 8.1.1 and the derivation of 
(8.1.13). □ 

Remark. 

1) Under coordinate changes g = g{f ) in the image, transforms into rj 
with 

df 

* “ W Vl - 

With this transformation behaviour, (8.5.3) is invariantly defined. 

2) The only variations that we shall need in the sequel are of the form 

if{x) = s(f(x))ip(x) (8.5.4) 

where s is a compactly supported smooth section oi TN and <p is a 
compactly supported Lipschitz continuous real valued function. For 
such if, f £ N) implies if £ H 1,2 by the chain rule. 

In particular, for such variations, (8.5.2) and (8.5.3) are meaningful even 
if / should not be localizable. 
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For the following result, D is the unit disk as usual, and 


D* := £>\{0} 


is the punctured unit disk. 

Lemma 8.5.3 Suppose f G H 1 ’ 2 (D*, R") satisfies 


J Df(z)Dip(z)dz = J g(z,f(z),Df(z))ip(z)dz 

D * D* 


(8.5.5) 


for all 

where g fulfills 


p G Hi' 2 nL°°(D*,K”) 


\g(z,f,p)\ < Co + Ci|p | 2 
with constants Cq,Ci for all ( z,f,p ) G D* x R" x R 2m . T/ien oZso 


Df(z)Da(z)dz = j g{z, f{z) 1 Df{z))a{z)dz 


(8.5.6) 

(8.5.7) 


D 


D 


for all a G H^ 2 D L°°(D,M. n ). 

The lemma says that weak solutions of (8.5.5) with finite Dirichlet integral 
extend as weak solutions through isolated singularities. Easy examples show 
that the assumption of finite Dirichlet integral is essential. 


Proof. For k G N, k > 2, we put 

( 1 for r < ()r) 2 

Afc(r) := < log(£)/log k for(i) 2 <r<i 
[ 0 for r > j: 

and for cr G Hi' 2 n L°°(D, R") 

p k {z) := (1 - A fc (|z|)M*) G Hq' 2 D R"). 

We now observe that 


[ \DXk(\z\)\ 2 dz = 2ir [ (^^) 2 rdr=-^- >0 as k — > oo. (8.5.8) 

J J dr log k 

D (fP 


By (8.5.5), 


J Df(z)D(p k (z)dz = J g(z,f(z),Df(z))(pk(z)dz. (8.5.9) 

D* D* 
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Because of / € H 1,2 and (8.5.6), 

g(z,f(z),Df(z)) G L 1 . 

Since < |cr| G L°° and since ip k converges to a almost everywhere, 
Lebesgue’s theorem on dominated convergence therefore implies that for k — > 
oo, the right hand of (8.5.9) tends to 

J g(z,f(z),Df(z))a(z)dz. 

D 

By (8.5.8), a G L°°, f G if 1 ’ 2 , and by Holder’s inequality, for k — > oo 
J Df(z)D(X k (z))a(z)dz -> 0. 

D 

Therefore, the left hand side of (8.5.9) tends to 

J Df(z)Da(z)dz 

D 

for k — » oo, and (8.5.7) follows. □ 

Corollary 8.5.2 Suppose that £ is a Riemann surface, p G £, N a Rieman- 
nian manifold, f G H 1,2 (£\{p},N). 

If f is weakly harmonic on 17\{p}, then f extends as a weakly harmonic 
map to £. 

Proof. A consequence of Lemmata 8.5.2, 8.5.3. □ 

Remark. Suppose that / : LA{p} — ► N is localizable and of finite energy 

E(f,£\{p})= J ||d/|| 2 <oo. 
z\{p} 

Then we can define the energy of / on £ as 

E(f;£) = E(f;£\{p}). 

The proof of Lemma 8.5.5 shows that this is meaningful. 

Our first aim is to prove the extension result needed in the proofs of 
Thms. 8.3.4 and 8.3.5, namely that a conformal harmonic map C — * N of 
finite energy extends to a conformal harmonic map on S 2 = CU {oo}. While 
the following results are correct even without the assumption of conformality, 
that assumption considerably simplifies the proofs. We divide the proof into 
two steps, first continuity and then smoothness. In order to explain the basic 
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idea of the continuity proof, we first consider an easy special case, namely 
N = R n . We are thus investigating weak minimal surfaces in Euclidean space: 

Definition 8.5.3 A map h G R") from a Riemann surface £ is 

called a weak minimal surface if h is weakly harmonic and conformal, i.e. 

1 ) 

J {h x ip x + h y ipy)dxdy = 0 (8.5.10) 

E 

for all ip G Hi’ 2 n L°°(£, R n ) 

(z = x + iy being a conformal parameter on £) 

and 

2) 

h x • h x = h y ■ h y , h x • hy = 0 almost everywhere. (8.5.11) 


We now show 

Proposition. Any weak minimal surface h G H^ c (£, R") is continuous. 

Proof. Since the result is local, we may assume £ = D, that the point where 
h has finite Dirichlet integral (energy) on D. 

We consider r G (0,1) and 

Zq G D r := {z G C : \z\ < r} 
p := h{z 0 ). 

We assume that for almost all 2 G dD r = {|^| = r} 

\h(z)—p\>p (8.5.12) 

(this means that the minimal surface h{D r ) has no boundary inside the ball 

The plan is to show that if r — > 0 then also p — > 0 for f> satisfying (8.5.1). 
We shall then apply the Courant-Lebesgue lemma to the extent that for 
suitable r, if \h(z) —p\ is small for one 2 G dD r then this is so for all 2 G dD r . 
Continuity will then follow from the triangle inequality. 

We first consider a general compact Riemann surface S with boundary 
dS and a weak minimal surface h G H l,2 (S, R") with 

| h(z) — p\ > p for all 2 G dS. 

We let r] G C 00 (]R) satisfy 

77(f) = 1 for t < \ 

77 (f) = 0 for f > 1 

7? / (t) < 0 for all f 


(8.5.13) 
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and choose as a test vector 

■= — -){h(z)-p) 

P 

for 0 < p < p. 

Because of (8.5.13), p> has compact support in the interior of S. Therefore, 
ip is an admissible test vector in (8.5.10), and thus 

J (h x ip x + hyip y )dxdy = 0 (z = x + iy). (8.5.14) 


We now define 


Mp) -=\j i^iV^J). 


If y is the characteristic function X(-oo,i) °f (— oo,l), A ri (p) is the area of 
the minimal surface h(S) inside the ball B(p,p). We compute 

A 'v(p) = ~^2 J \ Dh \ 2 \ h ~PW( \ h p P \ ) (8.5.15) 


and 

h x tp x + hyify = y(- — ~~)\ Dh\ 2 

+ V '(l^_^)_L_{((/ l _p) . h x f + ({h-p) ■ h y ) 2 } . (8.5.16) 

Since the vectors h x and h y are orthogonal and of equal length by the weak 
conformality of h, we estimate 

1. 


((h - p ) • h x y + ((h - P ) • h y y <-(K + h z v )\h - P i 

= l -\Dh\ 2 \h-p\ 2 


(8.5.17) 


The factor \ will be essential, cf. (8.5.18) below and its consequences. Since 
i i < 0, (8.5.16) and (8.5.17) imply 


h x ip x + hyipy > T]( 


\ h -p\vnu2 I 


'-)\Dh\* + rfV 


P P 

(8.5.14) and (8.5.15) then yield 

2 A n (p) - pA' v (p) < 0, 




2 p 


-\Dh\ 2 . 


(^#y> o, 


hence 


(8.5.18) 



458 8. Harmonic Maps 


and thus for 0 < p\ < p 2 < p 


Ari(pi) ^ A v (p2) 
2 / np\ ~ 2itp 2 


(8.5.19) 


We choose a sequence (rj n ) n6 n of smooth functions with the above prop- 
erties and tending to X(-oo,i)- By Lebesgue’s theorem on dominated conver- 
gence, we obtain in the limit with 

A{p) := Area {h(S) CiB(p,p)) 

the fundamental monotonicity formula for minimal surfaces which we record 
as 

Theorem 8.5.1 Let S be a compact Riemann surface with boundary dS and 
let h £ H 1,2 (S,M. n ) be a weak minimal surface, and suppose 

h(dS)nB{p,p) = Q). (8.5.20) 

Then is a nondecreasing function of p for 0 < p < J>. 

The result also holds for 0 < p < oo if S is a (noncompact) Riemann 
surface and h £ H^ C (S, R") is a proper weak minimal surface. Here, “ proper ” 
means that the preimage of each compact set in M™ is compact in S. 

Proof. The compact case has just been described. The claim for noncompact 
S follows by exhausting S by compact subsets. The properness of h guarantees 
that (8.5.20) is satisfied for sufficiently large compact subsets. □ 


We want to determine whether has a limit as p — > 0. 

Definition 8.5.4 Let T be a surface in a Riemannian manifold N, p £ N, 
A{T,p,p) := Area (T (i B(p, p)). If 

A(T,p,p) 

exists, then this limit is called the density of T at p. 

We observe that if T is closed and p (jL T, then 

d(T,p) = 0. 


If h is a smooth minimal surface, then as a consequence of the Hartman- 
Wintner-Lemma 8.2.6, we have an asymptotic expansion 

h z (z 0 ) = a(z - z 0 ) m 

with some a £ C n (a 2 = 0 since h defines a minimal surface) at every zq with 
some non-negative integer to, cf. Corollary 8.2.6, and 
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m = 0 

for almost all zq, because h z has only isolated zeroes. 

This easily implies 

d(hfS), h(zo)) = to + 1 

and 

d(h(S),h(zo)) = 1 for almost all zo . 
We now return to the case of a weak minimal surface h:S 


Lemma 8.5.4 Let h : S — > K” be a weak minimal surface. Then the (lower) 
density of h(S) at h(z) is at least 1 whenever 

z £ Sq :={y £ S:h is approximately differentiable at y, 

y is a Lebesgue point for \Dh\ 2 , and \Dh(y)\ 2 ^ 0}. 

Consequently, for z £ Sq, 

Area (h(S) fl B(h(z), g)) > 2i tq 1 

whenever 

h(dS) fl B(h(z),g) = 0. 


Proof. By the monotonicity formula (Theorem 8.5.1), we need to show that 
with K e := {a; £ S: \h(x) — h(z)\ < g} 

K e 

Now, with K e g := {x £ D e : \h(x) — h(z) — S7h(z)(x — z ) | < e\x — z\} 

J \dh(x)\ 2 > J \dhfx)\ 2 = J \\7h(x)\ 2 , 

d q K^nSo K^ e nSo 

where Vft denotes the approximate derivative (see § A.l), and we shall control 
the latter quantity from below. 

The domain of integration here is controlled by a radius in the image. In 
order to estimate the integral, however, we shall need to convert that radius 
into a radius in the domain. 

We put 
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Then, for 


x £ B £ (z , r e ) := {y £ B(z, r e ): |/i(ar) — h(z) — X7h(z)(x — z) | < e\x 

| h(x) — /i(,z)| < \Vh(z)(x — z)\ + e\x — z\. 

The conformality relations (8.5.11) now imply 

I Vh{z)(x - z ) | 2 < ^\\7h{z)\ 2 \x - zf 2 . 


Ah 


Thus, we obtain 

\h(x) — h(z ) | < 
for x £ B £ (z,r e ). This implies 


(^l v M^)l + £ ) \ x 


A <Q 


B e (z,r e ) C I< £ e , 

and so, since K £ \ ( K £ D Sq) is a null set, 


27 TQ 2 


r c )'kv^‘ 

J \ VHz) t ~2^\ yh{z)Y 


K e o nS 0 


up to an error term (arising from having B £ (z, r e ) in place of B(z, r e )) which, 
however, goes to zero as g, and hence also r £ tends to 0, because h is approx- 
imately differentiable at z. 

Inserting the value of r e , and letting first g and then e tend to 0, we obtain 

Kg 


The integrand, here, however, is |V/i(z)| , i.e. the value at the center z, and 
not |Vft.(x)| 2 . Thus, in order to complete the proof, we need to estimate 

j \\Vh{z)\ 2 -\Vh(x)\ 2 \dx. 

K^nSo 

Again, we need to translate the radius in the image into one in the domain, 
but this time with an inequality in the opposite direction. 

Wlog e < |V/i(z)|, and so for x £ K £ n S 0 

\x-z\< g(\X7h(z )\ - ep 1 =: R e , 


i.e. 


K £ nS 0 cB(z,R e ). 
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Therefore, 


^7 f \\Vh(z)\ 2 -\Vh(x)\ 2 \dx 
K‘ns 0 

~ (|V/i(z)| — e) 2 2-kR 2 / 

B(z,R e )nSo 


\\7h(x)\ 2 dx. 


If we then let g , and hence R e tend to 0, the last integral also goes to 0 
because z is a Lebesgue point for \dh(z)\ 2 . Thus, the proof is complete. □ 


In order to also include points where h is not approximately differentiable, 
or that are not Lebesgue points for \dh(z)\ 2 , we now claim that the lower 
density 


lim inf 

p—>o 


A(h (S) , h(z) , p) 
2n p 2 


is an upper semicontinuous function of z. 

Let p n — > 0 for n — > oo. 

By the above, we find sequences (z ra )neN C S, (£ n )neN C R, £ n — > 0 as 
n — > oo, 

| h(z) - h(z n ) | = £ n p n - 


Then 


A(h(S),h(z),p n ) > A(h(S),h(z n ), (1 - £ n )Pn) 

27T Pl ~ 27T Pl 

since B(h{z n ), (1 - £ n )pn) C B(h(z),p n ) 


_ A(h(S ') , h[z n ) , (1 £n)Pn) ,2 

■ 2 tt ((1 - £ n )p n y 1 " £n> 

>d(h(S),h(z n ))( l-£n) 2 
by monotonicity at h(z n ), 

and upper semicontinuity follows. 


We now return to the proof of the Proposition and put S = D r . The 
preceding argument, Lemma 8.5.4 and Theorem 8.5.1 say 


for 0 < p < p, unless Vh 


1 < 


Ajp) 

2np 2 


(8.5.21) 


0 locally, which, however, represents a trivial 


case. 
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Since 


and 


m <lj m 2 , 

D r 

lim / \Dh\ 2 = 0 monotonically 

r *o J 


(8.5.22) 


D r 

(it follows by applying Lebesgue’s theorem on dominated convergence to 
fXD r that 

lim / f = 0 

r^0j 

D r 

for any integrable /), we conclude from (8.5.21) that 


p — > 0 as r — » 0. 

This means, by definition of p, 

inf \h(z) — h(zo) | — > 0. (8.5.23) 

z€dD r 

On the other hand, the Courant-Lebesgue-Lemma 8.3.5 says that for any 
r 0 < 1, there exists r with r 0 < r < such that for all z, z' £ d D r 

I Hz) h(z’) I < ~7~~YyT ( / \Dh\ 2 )K (8.5.24) 

0Sr 2 D r o 

and the right hand side goes to zero when rg — » 0, hence r — > 0. 

Let now e > 0 be given. We then find sufficiently small r > 0 so that first 
the right hand side of (8.5.24) is < | and that for every z o £ D r , the infimum 
in (8.5.23) is also < |. For zo,z' 0 £ D ri let then z and z' , resp., be points in 
dD r where the infimum in (8.5.23) is attained. The triangle inequality gives 
\h(zo) — h(z' 0 )\ < e, hence continuity. □ 


We now want to prove continuity of weak minimal surfaces in Riemannian 
manifolds. 


Definition 8.5.5 A map h £ N) from a Riemann surface £ into 

a Riemannian manifold N is called a weak minimal surface if it is weakly 
harmonic and conformal, i.e. 
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1) 


( dh , dip) = 0 


(8.5.25) 


E 

for all compactly supported bounded H 1,2 sections ip of h~ 1 TN 
((•, •) here is the scalar product in T* E <8> h~ 1 TN) 


2) 


( h x , h x ) = (hy, hy ), (h x , hy) = 0 (8.5.26) 

almost everywhere 

((•,•) is the scalar product in h~ 1 TN) 

For 1), cf. Definition 8.5.1 and Lemma 8.5.1. 

In contrast to the existence theory, for regularity results we do not need 
the compactness of the ambient manifold N. It suffices to have a uniform 
control on the geometry of N : 

Definition 8.5.6 We say that a Riemannian manifold N is of bounded ge- 
ometry if 

1) z(iV) := inf i{jp) > 0, where i denotes the injectivity radius 

peN 

and 

2) A := sup \K\ < oo, where K denotes the sectional curvature. 

N 

Theorem 8.5.2 A weak minimal surface H £ N) (E a Riemann 

surface) in a Riemannian manifold N of bounded geometry is continuous. 

Proof. We shall translate the argument of the above Proposition from the 
Euclidean case into a Riemannian context. Thus, the strategy of proof will 
be the same as before. 

Again, it suffices to treat the case E = D, h € H 1 ' 2 (D, N), and to prove 
continuity at 0. 

We let 

1 7 r 

0 < P0< - i(N)), 

0 < r < 1 

z 0 G D r = {\z\ < r},p := h(z 0 ) 

We assume that for almost all z £ dD r = {\z\ = r} 

d(h(z),p) > p 


(8.5.27) 
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with 


0 < p < po, 

where d(-,-) denotes the distance function of the metric of N. As before, we 
let r\ G C 00 ^) satisfy 

77(f) e 1 for t < \ 

77(f) = 0 for t > 1 

77 '(f) < 0 for all t, 

and again, we later on let 77 increase to the characteristic function X(-oo,i)- 
We now choose as test vector 

<P(z) := -i) P ) g T h(z) N. 

p is bounded, of class H 1 ’ 2 (namely 

J {dp, dp) < const. J { dh,dh ) <00, 

for example by (8.5.30) below, or directly from the chain rule), and by 
(8.5.27), it has compact support in D r . Therefore, p is an admissible test 
vector, and by (8.5.25) 

J{dh,dp) = 0. (8.5.28) 

E 

In order to evaluate (8.5.28), we compute 

{dp, dh) = (V a pdx + W _a_pdy, h x dx + h v dy ) 

dx dy y 

= 77( d( ~^^ )((VjL(-exp l 1 p),h x ) 

+ (V^(-exp ^ P ),h v )) (8.5.29) 

+ ((-exp k 1 p),h y ) 2 ) 

(cf. (4.6.6)). 

We have to estimate the covariant derivatives of (—exp j^p). 

For this purpose, let h{s) be a smooth curve in N. In order to control 
V _a_ exp^j p, we consider the family of geodesics 

c(f , s) := ex P/t(s) (f exp-^ p) 

d 

— c(f,s)| t=0 = exp Ms) p 


Then 
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and thus 


g 

V JL ex P^ s )P = V JL^c(t, S )| t=0 

„ d , , 

= V &fo c(t ’ s) r*=° ■ 

For fixed s, J s (t) := Jjc(f, s) is a Jacobi field along the geodesic c(-, s) with 

6>/i , 


J s (0) = /i'(s) (:=— ) 
J,( 1) = 0 G T P 1V, 


9 


Js (°)( := ^ J «(°)) = v £ ex Ph( s )P 

From Corollary 4.5.1, we have the Jacobi field estimate 


|J S (0) + j a (0)|| < -Ad 2 (h(s),p)\\J s (0)l 


hence 


II exp ft( 1 s) p+ h'{s)\\ < -Ad 2 (h{s),p)\\h'{s)\\. (8.5.30) 


We shall use (8.5.30) to compare V a_ exp ?] p with h x . 

The conformality relations 

{h x , h x ) = (h y , h y ), ( h x ,h y ) = 0 almost everywhere 

imply 

(exp l 1 p,h x ) 2 + (exp- 1 ^,^) 2 < *( \\h x \\ 2 + ||/i y || 2 )|| exp-^f 


= 2 l|d/ l || 2 -d 2 (/t,p) 


(8.5.31) 


The factor \ will be crucial. 
We define 


Mp) -=\\ \\dh\\ 2 v {^~). 

D r 

Then, because of (8.5.27) and p <p 

A » = ~2 ^ / Wdh\\ 2 d(h,pW(^-). 
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From (8.5.29), we get, since rf < 0, 77 > 0, 

(d(fi,dh) > r)( d ( h,P ^ )((V ^(-exp" 1 h),h x ) 

P dx ^ 

+ (V^(— exp” 1 h),h y )) 

by (8.5.31) 

P 2 p 

> + ,. ( f& ^)<ShA Wh f 

P P 2p 

- ^n( d ^ h,P ^ )d?(h,p)\\dh\\ 2 (by (8.5.30)) 

2 p 

and then from (8.5.28) 


2 A v (p) - pA' v (p) < Ap 2 A 11 (p). (8.5.32) 


This implies 


hence 


/ A v 




y> 0 , 


MP}l e i P l < 

2ttp? 


A M r 4 p% 

2ttp 2 


(8.5.33) 


whenever 

0 < Pi < P 2 < P- 

We again let rj approach the characteristic function X(_ OCj i) and obtain with 
A(p) := Area ( h(D r ) (~l B(p, p)) 


the following monotonicity formula 


ypf ) e 4p? 

2 kp \ 


< 


a ^ c 4pI 

2 tt p\ 


(8.5.34) 


whenever 0 < pi < P 2 < p. 

Again, if pi —> 0, the left hand side of (8.5.34) tends to the density of 
the minimal surface h{D r ) at p = h(zo), and this density again is a positive 
integer. 

Therefore, choosing p 2 = ~p in (8.5.34) 


P 2 < 



1 A „2 

< — e 2P ° 
27T 



D r 


since p < po- 


(8.5.35) 
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This is impossible, if r < rg and rg is chosen so small that 


f \\dh\\ 2 < 27re ^ p ° p 2 . 

Therefore, for such r, (8.5.27) cannot hold. Thus, for 0 < r < rg 


(8.5.36) 


essinf d(hfz), h(zg)) < p. (8.5.37) 

zedD r 

Also, by the intermediate value theorem, we can find r with \rg < r < rg 
and 

d(h(z), h(z')) < — — — - T ( f ||d/i || 2 )2 for all z, z' G dD r (8.5.38) 
(log 2) 2 J 

V ro 

(this is an alternative to the use of the Courant-Lebesgue lemma 8.3.5, the 
proof is similar). 

We then choose r 0 so small that in addition to (8.5.36) 

/ W dh W 2 <7jjr^ 2 - (8-5.39) 

Dr o 

For z 0 ,Zg G D r , |ro < r < r 0 , r satisfying (8.5.28), we find z,z' G dD r 
for which the infimum is attained in (8.5.37) for zg and z'g, resp. Then from 
(8.5.37) and (8.5.38) and the triangle inequality 

d(h(z 0 ),h(z' 0 )) < 3 p. 

Since this holds for all zo,z' 0 G D r , where r is estimated in terms of ~p 1 
continuity at 0 follows. □ 


Perspectives. In Theorem 8.5.2, we have shown that weakly harmonic and con- 
formal maps of finite energy from a Riemann surface into a Riemannian manifold 
(of bounded geometry) are continuous. The conformality of the map is not needed 
for this regularity result as was shown by Helein[114]. A systematic treatment is 
given in [115]. The removability of isolated singularities of weakly harmonic maps 
was already obtained by Sacks and Uhlenbeck[210]. The proof of the continuity of 
weak minimal surface given here partly uses some arguments of Griiter[105]. 
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8.6 Higher Regularity 

In this paragraph, we study continuous solutions / € iJ 1,2 (l7,]R"), 17 open 
in R m , of a system 

J a a ^(x)D a f l (x)Dp<p l (x)dx = J G l (x, f(x), Df(x))ip l (x)dx (8.6.1) 

Q O 

for all ip G H]' 2 n L°°(S 7,R n ). 

We shall assume the following structure conditions: 

(a Q/3 (a;)) ai( g = i i ... im is symmetric for almost all x, the coefficients 

a. a P{x) are measurable 

a a/3 {x)£a£f} > A|£| 2 for all £ = (£i, . . . , £ m ) G R m and almost all x G 17 

(711) 

with a constant A > 0. 

\a a/3 (x)\ < K for almost all x G 17 (.42) 


with a constant K. 

G(x, f,p) = (G 1 , . . . , G n ) is measurable in x and continuous in / and p. 
(This implies that G{x, f(x), Df(x)) is measurable in x for / G Hj oc .) 

\G(x, f,p)\<co + ci|p| 2 for all (x,f,p) G 17 x R” x (Gl) 

with constants Co,Ci- 

Later on, a a & and G ? will be assumed even differentiable, and so we may 
as well assume here that they are continuous instead of just measurable. 

If / is a continuous weakly harmonic map, then continuity allows us to 
localize the situation not only on the domain, but also in the image, i.e. to 
write everything down in fixed local coordinates. The preceding structural 
conditions then are satisfied, cf. lemma 8.6.2. 

Some notational conventions: 

We usually omit the indices in the image; thus e.g. 

D a f ■ Dfjtp := D a f l Dptp 1 with the standard summation convention. 

(Usually, also the dot will be omitted.) Also, we shall always integrate 
w.r.t. to the Euclidean volume element dx on i7, and this will often be omit- 
ted. 

We start with the following auxiliary result 

Lemma 8.6.1 Suppose f G G° D 77 1,2 (17, R") solves (8.6.1), where the coef- 
ficients satisfy (Al), (A2), (Gl). 

Then for every e > 0, there exists p > 0, depending on e, m, the structural 
constants X, I\, cq, C\, and on the modulus of continuity of f, with 
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J \Df\ 2 r/ 2 {x)dx < e J \Dij\ 2 dx 

B(xi,p) B(xi,p) 

whenever B(xi, p) C fl and p £ Hq’ 2 (B(xi, p), R). 


( 8 . 6 . 2 ) 


Proof. We choose 

<p(x) ■= (f(x) - f(x i))v 2 (x) 

in (8.6.1). We obtain 


f a al3 (x)D a fDpfr] 2 <c 2 sup \f(x) - /(a?i)| f \Df\ 2 rj 
J x€lB(x\,p) J 

B(xi,p) 

+ c 3 sup \f(x) — f(x\)\ j rj 2 
x£B(xi,p) J 


B{x\,p) 


+ 2 J a a ^(x)D ot fD 0 p{f(x)-f(x 1 ))p 

B(xi,p) 

because of (Gl) 

< c 2 sup | / (x) — f(xi)\ J \Df\ 2 p 2 

B(xi,p ) 

+ c 4 sup | f(x) - /(xi)| p 2 j \Dp\ 2 

B(xi,p) 

+ \ J a af) {x)D a fDpfp 2 

B(xi ,p) 

+ 8 sup | f(x) - f( Xl )\ 2 Ja^x)D aV D 0 r, 


where we have used the Poincare inequality (Corollary A. 1.1) for the second 
term. The claim follows with (Al), (A2) because we can make 
sup | f(x) — fix i)| arbitrarily small by choosing p sufficiently small, since 

B(xi,p) 

f is continuous. □ 


In order to proceed, we have to make additional structural assumptions 
about the system (8.6.1): 

The coefficients a a ^(x) are differentiable and 

|D 7 a Q/3 (a:)| < A'i for all a, (3, 7 = 1, . . . , m, x £ f2 (A3) 

with a constant K\. 

G = (G 1 , . . . , G") is differentiable with 
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\D x G(x,f,p)\ < 7 o + 7 i \p\ 3 (G2) 

\DfG(x, f,p)\ < 72 + 73b| 2 
\DpG(x,f,p)\ < 74 + 7r|p| 

In order to show the main idea of the subsequent regularity argument, we 
shall first derive a so-called a priori estimate. This means that assuming that 
we already have a regular solution, we can estimate its norms. 

Lemma 8.6.2 Suppose 

f e C° n h 1a n h 3 ’ 2 (B(x 0} R),R n ) 

is a solution of (8.6.1) with 17 = B(xo,R ), where the structural conditions 
(Al), (A2), (AS), (Gl), (G2) are satisfied. 

Then 

\\ D2 f\\L^B( Xo ,f)) + \\D.f\\l HB{xoS)) < CoR? +C 1 \\Df\\ L2(BM) (8.6.3) 

where Co and C\ depend on the structural constants in (Al) - (G2), on m, 
and the modulus of continuity of f. 

Proof. Since / € iJ 2,2 , for tp € Hq 2 

J a a0 D a fDpp = - J Dp(a a P D a f)ip. (8.6.4) 

We now put 

¥> = D^D.f) 

with ( G L°° fl hl' 2 (B(x 0 , R), K) to be determined later on. 

From (8.6.1), (8.6.4) 

J D 1 {a^D a f)Dp(eD 1 f) 

B(x 0 ,R) 

= - j a a ?D a f ■ Dp{D 1 (f 2 D 1 f)) 

B Go.R) , , 

, ( 8 - 6 - 5 ) 

. - j G(a;,/,77/)77 7 (e 2 D 7 /) 

B(xo,R ) 

= J D 1 (G(x, /, Df))D 1 f ■ £ 2 . 


B(xo,R ) 
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Now 

D 7 (a^D a f)D 0 (eD 7 f) 

= a a(i D 1 D a f ■ DfiD 1 f ■ e + a a P D 7 D a f ■ TXJ ■ D^ 2 (8.6.6) 

+ D 7 a a P ■ D a f ■ DpD 7 f ■ e + TXa al3 ■ D a f ■ D 7 f ■ D^ 2 

and from (G2) 


\D 7 G(x,f,Df)\\D 7 f\ <c 5 \Df\+c e \Df\ 4 + c 7 \Df\ ■ \D 2 f\ 

+ c 8 \Df\ 2 \D 2 f\ (8.6.7) 


and from (Al) 

\D 2 f\ 2 < ja a0 D 7 D a f ■ D^Dpf. (8.6.8) 

From (8.6.5) - (8.6.8) we conclude, using also (A2), (A3) 


J \D 2 f\ 2 -e<C 9 J \D 2 f\\DmDt\+c w J \D 2 f\\Df\e 

B(xo,R) B{xq,R ) B(xo,R) 

+ C11 J \Df\ 2 \£Dt\+c 5 j e 

B(xq,R) B(xo,R ) 

+c 6 J \Df\ 4 e+c 8 J \D 2 f\\D/\ 2 e 

B(xo,R) B(xo,R ) 

< £ic 9 j \D 2 f\ 2 e + ^ j \Df\ 2 \Dtf 

B(xo,R) B(xo,R) 

+ C2CW J |C 2 /| 2 f + g / \Dl\ 2 e 

B(xq,R ) B(xo,R ) 

+ c -f j \D.f\ 2 m 2 + c -f j m 2 e 

B(x 0 ,R) B(x 0 ,R ) 

+c 5 J e+c 6 J \Df\r 

B(xo,R ) B(xo,R) 

+ £3C 8 J |A> 2 /| 2 ? 2 

B(x 0 ,R) 

+ S / |B/|4?2 (8 - 6 - 9) 

B(xq,R) 

with arbitrary positive e 1 X 2 X 3 , where we have used the inequality 
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ab < ear + — b 2 for arbitrary £ > 0, a, b £ M . 
4£ 

We may choose 61 , 62,63 > 0 so small that 


( 8 . 6 . 10 ) 


£lCg + £2 Ciq + £3Cs < - 


and obtain 


J \D 2 n 2 e<ci 2 J i^/i 2 i^i 2 

B(x 0 ,R) 

+C13 J e+ci 4 J m*e (8.6.11) 


B(x 0 ,R) 


B(x 0 ,R) 


B(x 0 ,R) 


For £ > 0, we now choose p > 0 as in lemma 8.6.1. 

We assume 

B(xi,p) C B(x 0 ,R) 

and choose £ € Cg°(B(xi,p)) with 0 < £ <1, 

£ = 1 on B(xi, ^) 

1^1 < - 

P 

Thus, all preceding integrals need to be evaluated only on B(xi,p). We now 
write 

j \D.f\ 4 e= j \Df\ 2 (\Df\ 2 -e ) 

B(xi,p) B(xi,p) 

and apply Lemma 8.6.1 with p = \Df\ ■ £ and obtain 


I \Df\ 4 e<e j |I>(|£>/|£ 2 )| 2 (8.6.12) 

B{x u p) 

J \D 2 /\ 2 e+e J iwi 2 i^i 2 


B(xi,p) 


B(xi,p) 

We may choose e > 0 so small that 


B(xi,p) 


ec 14 < - • 


(8.6.11) and (8.6.12) give 
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J \D 2 f\ 2 <c 15 J e + cw J \Df\ 2 \D(\ 2 

B(x i,f) B(xi,p) B(xi,p) 

<ci 7 p m +^f J \Df\ 2 . (8.6.13) 

B(xi,p) 

Covering B( x 0 , R ) by balls B(x\, |) with B{x\,p) C B(x 0 , R ), we obtain the 
desired estimate for 

J \D 2 f\ 2 . 

B(x 0 ,f) 

(8.6.12) and (8.6.13) and the same covering argument then also yield the 
estimate for 

J \Df \ 4 . □ 

B(x 0 ,f ) 

However, we cannot apply Lemma 8.6.2 because we do not know yet that 
/ £ H 3 ’ 2 . The point, however, is that the conclusion does not depend on 
the H 3 ’ 2 -norm, and a slight modification will give us the desired regularity 
result: 

Lemma 8.6.3 Suppose that 

/€C ,0 ni? 1 ’ 2 (H(a;o,fi),M n ) 

is a solution of (8.6.1) with fl = B(x o, R) and the structural conditions (A1 ), 
(A2), (A3), (Gl), (G2). Then 

/Gi7 2 ’ 2 nH 1 - 4 (H(cr 0 ,|),lR"), 

and the same estimate as in Lemma 8.6.2 holds. 

Proof. We just replace certain weak derivatives by difference quotients (cf. 
(A. 2.1)) in the proof of Lemma 8.6.2. Namely, we put 

with ( as above. 

Analogously to (8.6.11), we get with A h = (Af , . . . , 

J \o(A h f)\ 2 e 

B(x 0 ,R) 

<ci 2 J \A h .f\ 2 \D(\ 2 (8.6.14) 

B{xq,R ) 

+ci3 J e+c u j \Df\ 2 \A h n 2 e . 

B(xq,R) B(xq,R) 
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But 

J \A()f\ 2 \Df\ 2 < J \Df\ 2 \D£\ 2 

(this is similar to Lemma A. 2.1). 

Using Lemma 8.6.1, we then obtain analogously to (8.6.13) 

J \D(A h f)\ 2 e<c 17 p n + ^ j \Df\ 2 . (8.6.15) 

B( xi,|) B(xi,p ) 

Lemma A. 2. 2 then shows that the weak derivative D 2 f exists and satisfies 
the same estimate. Likewise, we get control over the id-norm of Df. □ 

Lemma 8.6.4 Let f € C° CiH 1 ’ 2 (B(xo, R),R n ) be a solution of (8.6.1), with 
structural conditions (Al), (A2), (A3), (Gl), (G2) satisfied. Then 

R 

Df £ L P (B( xo, — )) for every p < oo, 

and 

j \Df\ p \D 2 f\ 2 < oo. (8.6.16) 

B(x o,f ) 

Remark. As in Lemma 8.6.2, one also gets a-priori estimates, with constants 
depending also on p. 

Proof. By Lemma 8.6.3, we know already 

Df £H^ 2 CM\B(x 0 ,|)). 

We put 

w := \Df\ 2 . 

We are going to show by induction that for every s £ N, s > 2, and R\ < f) 
J (w s + w s ~ 2 \D 2 f\ 2 ) < oo. (E s ) 

B(xo.Ri) 

By Lemma 8.6.3, (E 2 ) holds. We assume (E s ) and want to conclude (E s+ i). 
We put 

Wl(x) := min(w(x), L) for L > 0. 


We observe 


Dwl(x) = 0 if w(x) > L 
\Dw\ < 2|D 2 /M 

\Dw l \ < 2\D 2 f\wl from (8.6.17), (8.6.18). 


(8.6.17) 

(8.6.18) 
(8.6.19) 
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Let 77 G L°° fl Hq , ‘ 2 (B(xq^ Ri)). We compute, for x\ G B(x 0 ,Ri) 


2 s 

rj w wl = 


rfDf • Dfw s 1 wl by def. of w 


B(xq,Ri 


B(xq,Ri ) 


B(xq,Ri ) 


= J v 2 D(f-f(x 1 ))-Dfw s - 1 w L (8.6.20) 

B(xo,Ri) 

<( 2 m+l) sup \f(x)-f(x 1 )\ f if\D 2 f\w s ~ 1 w L 
x£B(xo,Ri)r)suppr] J 

B(xo,Ri ) 

+ 2 sup \f(x)-f(x 1 )\ / 'qDrjw^WL, 

x£B(xo,Ri)nsuppri J 


B(xo,Ri) 

integrating by parts and using ( 8 . 6 . 18 ), ( 8 . 6 . 19 ). 


We now write 


rj 2 \D 2 f\w s 1 wl = (rj\D 2 f\w 2 wl)(ijw 2 wl) 


2s — 1 s I s — 2 I 

i^Drjw 2 wl = ( r]w 2 wl){Dr]w 2 w£) 


and obtain from (8.6.20) (with 2 ab < a 2 + b 2 ) 


[ T] 2 W S Wl< sup \f{ x ) — f{ x l )\{ c 19 f r] 2 \D 2 f\ 2 w s 2 w l 

J xeB(xo,Ri)r\ suppr] ^ J 

B(xo,Ri ) B(xo,Ri) 

+ c 20 J 7 j 2 w s w L c 2 i J \D 7 ]\ 2 w s ~ 1 wl'^ ■ (8.6.21) 


B(xo,Ri) 


B(xq,Ri) 


Here, the constants C19 and c 2 o also depend on s. 

Since / is continuous, given £ > 0 there then exists R(s) with the property 
that for 0 < i? 2 < R(e) and T] € Hq’ 2 (B(xi, i? 2 )) (H(xi,l? 2 ) C S(xo,i?i)) 

J 7 ^ 2 W S Wl < £ J T] 2 \D 2 f\ 2 W S ~ 2 WL + £ J \D 7 i\ 2 W S ~ 1 Wl- 


B(xi,R2) 


B(aji,i?2) 


B(xi,R2) 


We now require for ry £ H^’ (B(x\, i? 2 )) 


(8.6.22) 


V= 1 on E(a;i, : y), 

0 < ?y < 1, 

l-Dr/l < L. 

R2 

Since / G W 2,2 by Lemma 8.6.3, the equation (8.6.1) yields for 0 G iLg 
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J D 7 (a al 3 D a f)D^ = - J a a/ 3 D a fD 7 D^ 

= - J G(x,f,Df)D^ (8.6.23) 
= J D 1 G{x, f , Df)i> 

The resulting equation 

j D^D a f)D^ = J D 7 G(x, f , Df)ip (8.6.24) 

then also holds for ip £ H q’ 2 (instead of H q’ 2 ), because we can approxi- 
mate ip £ Hq' 2 by // ( j’ 2 -functions (actually, even by Cg 0 -functions), and an 
easy application of Lebesgue’s theorem on dominated convergence allows the 
passage to the limit. 

We apply (8.6.24) to 

ip := r] 2 w S M 2 w L D 7 f 

and obtain 

J D 1 {a a0 D ol f){Dj 3 {r?w S M 2 w L D 1 f)) 

B( Xl ,R 2 ) 

= I D 7 (G(x,f,Df))n i w£ 2 w L D^f (8.6.25) 

B(xi,R 2 ) 

and from this equation and the structural conditions (as in the derivation of 
(8.6.9)) 

J o af) D 1 D a f ■ D 0 (rf J w S M 2 w L D 1 f) 

B(xi,R 2 ) 



B(xi,R 2 ) 


+ C23 J 7]Dr]ww s M 2 w L (8.6.26) 

B(xi,R 2 ) 

+ C24 J 2 W L + C 25 J lfw 2 W S M 2 W L 

B(x±,R 2 ) B(xi,R 2 ) 

+ C26 J V 2 \ D 2 f\wW S M 2 W L 

B(xi,R 2 ) 

where C 22 again depends on s. 
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Now 

J if \D 2 f\wwlJ 2 w L < J rj 2 \D 2 f\w s ~ 2 w L 

<6 J r] 2 \D 2 f\ 2 w s ~ 2 w L + r] 2 w s w L , (8.6.27) 

and this is bounded because of (E s ) and since wl is bounded. 

Likewise 

J V 2 \ D 2 f\w?w S M 2 w L 

<6 J r] 2 \D 2 \ 2 w s ~ 2 w L + ^ J r] 2 w a ~ 1 w L (8.6.28) 

Therefore, all terms on the right hand side of (8.6.26) remain bounded as 
M — » oo. The same then has to happen for the left hand side of (8.6.26). We 
may hence replace wm by w in (8.6.26), and conclude that 

J a a0 D a D^f ■ Dp(rj 2 w s - 2 w L D 1 f) < oo. (8.6.29) 

B(x lt R 2 ) 


But this expression equals 

j a a0 D a D 7 f ■ DpD 1 fw s ~ 2 w L r] 2 
+ (m — 2) J a a0 D a D 1 fDpww s - 3 WLD 1 fri 2 
+ J a 010 D a D 1 fD/ 3 WLW s ~ 2 D 1 fr] 2 

+ 2 J a a0 D a D^f ■ r]D l3 r]w s ~ 2 w L D^f 


(8.6.30) 


Since Dpw = DpDsf ■ Dsf and Dwl = 0 for w > Wl, we can rewrite the 
second and the third integral in (8.6.30) as 

m [ aCtl3 D a wDpww s - 3 WLri 2 + ^ f a a0 D a WLDpWLW s ~ 2 r] 2 > 0. 

(8.6.31) 

The fourth integral in (8.6.30) is estimated by 

S j \D 2 f\ 2 w s ~ 2 w L r] 2 + -^r- J w s ~ 1 w L \Dr]\ 2 (8.6.32) 


05 > 0 ). 

Choosing 6 > 0 small enough in (8.6.27), (8.6.28), (8.6.32), we obtain 
from (8.6.26) - (8.6.32) (recalling 0 < r) < 1, \Drj\ < ^) 
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J \D 2 f\ 2 rj 2 w s 2 w l < 

B(xi,R,2) 


^ J a al3 D a D 1 fDpD 1 fw s 2 w L g 2 

B(xi,R.2) 


< C28 


W WlT]‘ 


B(x\,R2) 


+ C2 g(l + — 2 ) 

41-2 


S(a:i ,-R2) 


(8.6.33) 


We then choose £ > 0 in (8.6.22) small enough (and thus determine R(s)) to 
obtain from (8.6.22) and (8.6.33) 

J (\D 2 f\ 2 w s ~ 2 w L + w s w L ) < c 30 (l + -^) J iv s . (8.6.34) 

B( 2 B ( x 1 . 4 * 2 ) 

We may then let L — » 00 in (8.6.34). 

A covering argument then gives for every Ri < Rq 

J (w s+1 + w s ~ 1 \D 2 f\ 2 ) < 00 . (£«+i) 

B(x 0 ,R!) 

This concludes the induction. □ 


We obtain 

Lemma 8.6.5 Let f G C° ft H[° 2 (f2, K") be a solution of (8.6.1), with struc- 
tural conditions (Al), (A2), (A3), (Gl), (G2) satisfied, and furthermore 
G C 2 {(2) for alla,/3. 

Then 

f G 


Proof. From (G2), 


| — G(x, / ( 2 ), Df(x)) | = | G x + GfDf + G P D 2 f\ 

< k 0 + h\Df \ 3 + k 2 \D 2 f\ + k 3 \Df\\D 2 f\, (8.6.35) 

and this is in L 2 by Lemma 8.6.4, 8.6.3. 

Consequently, / is a weak solution of an equation 

D p {a aP (x)D a f) = g(x) (8.6.36) 

with g G H 1 ’ 2 . 

The claim follows from Theorem A. 2.1. □ 


We can now prove 

Theorem 8.6.1 A continuous weakly harmonic map f : M — > N between 
Riemannian manifolds is smooth. 
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Proof. As explained before, by continuity, we may localize in domain and im- 
age, and we thus treat a continuous weakly harmonic map as a weak solution 
of the elliptic system 

= -Vry at 3 ri k (f(x))D a fD 0 f k =: k(x). (8.6.37) 

The structural conditions (Al) - (G2) then are satisfied. 

Lemma 8.6.5 implies 

/ e H loc- 

Now 

\D 2 (^r 0 r; k (f)D a fD 0 f k )\ 

< Ko\Df\ 2 + Kl \Df\ 4 + K 2 \D 2 f\\Df \ 2 + K 3 \D 2 f\ 2 + K 4 \Df\\D 3 f\. 


If m := dim M < 3 then Sobolev’s embedding theorem (Theorem A. 1.7) 
already implies that this is in L 2 oc . Hence, the right hand side k of (8.6.37) 
is in H 2 ° 2 and by Theorem A. 2.1, 


In this manner, inductively 

/ G H% => k G H "- 1 ' 2 => / G HI+ 1 ’ 2 , (8.6.38) 

and Corollary A. 1.2 implies / G C°°. 

If m — dim M is arbitrary, one either can apply more refined elliptic 
regularity results, or alternatively observe that Df satisfies a system with 
similar (actually, even better) structural conditions, and so the preceding 
results may be applied to Df instead of /. Iteratively, the same is true for 
higher derivatives of /, and thus one gets again 

D v f e nl 2 c 

for all ry i.e. 

/ e h £ 

for all £, hence f G C°° by Corollary A. 1.2. □ 


Perspectives. The regularity results and proofs of this paragraph are due La- 
dyzhenskaya and Ural’ceva[165] although this is usually not acknowledged in the 
western literature on harmonic maps. Their proof has been adapted to harmonic 
maps into spheres in [26]. 
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8.7 Formulae for Harmonic Maps. The Bochner 
Technique 


A) We first want to derive the formula for the second variation of energy. 
For this purpose, let 

fst(x) = f(x,s,t ) 

/ : M x (—£,£) x (—£,£) — > N 


be a smooth family of maps between Riemannian manifolds of finite energy. 
M (but not N) may have nonempty boundary, in which case we require 
f(x, s, t ) = f(x, 0, 0) for all x € DM and all s, t. 

We put 


V := 


dfst 

ds | s =t=o’ 


W := 


dfst 

dt | s =t=o 


We want to compute 


d 2 E{f st ) 
dsdt | s =t=o 


To simplify notation, we usually write / instead of f s t , and also 


df = P~dx° 

J dx a 


df_ 

dx a 


dx a <8) 


d 

W v 


a section of T*M®/ 1 TN. 

Then ^ 

>m E ^ = y 

M 

We compute the integrand: V will denote the Levi-Civita connection in 
f~ 1 TN, and everything will be evaluated at s = t = 0 : 
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a , 0\ 

dtds 2W ’dxf* /T*M®f~ 1 TN 

— lm a ^L dx a ^Lcix?) 

— \ v #: “ x > n„a ux /T*M®f~ l TN 


df 


dt 9a dx c 


dx0 

since V is metric 


- !( V a ( ^-)dx a dx P) 

~ dV ' F>v& /T* M®f~ 1 TN 


'ds ’&e0 

since V is torsionfree 


df df 

Q x 0 dxl3 '>T*M®f~iTN 

+ (^gfn -(-Q s )dx a ,\7 _^Lp{-Q^)dx l3 ) T , M ®f- 1 TN 

df df 

= ^ V af^ V ’ Q x 0 dxt3 '>T*M®f-iTN 

+ (R n (— dx a 9f dx?) 

+ { ( dt’dx a, ds ' dx? / T* M®f~ 1 TN 

+ (y g§G-V dxOL 1 ^ -^pWdx /3 ) T *M®f- 1 TN 

by definition of the curvature tensor i? w of AT 

df 

m (~ds^’ d f'} T *M®f~ 1 TN 
- tr&ce M {R N (df,V)W,df) f - 1TN 
+ trace M (W, VFF) f-i TN - 


Thus 

d 2 E(f st ) 
dsdt |*=t=o 

= J (VV ,vw) f - 1TN 

M 

- J trac e M (R N (df, V)W, df) f -i TN (8.7.1) 

M 

+ J (VV,(-),^ w _i T , . 

M 
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We want to examine the third term in (8.7.1) more closely: 




T* M®f~ 1 TN 


j ds dx<X ' ^ cte/ 3 )r*M®/- 1 TN 

M 

since V is metric and integrating by parts 


f 9f 

= - (V_a — , trac e M Vdf) f - 1TN . 


(8.7.2) 


Theorem 8.7.1 For a smooth family f st : M — » TV of finite energy maps 
between Riemannian manifolds, with f s t{x) = foo( x ) f or oM x G dM (in 
case dM ^ 0) and all s,t, we have for the second variation of energy, with 
y — w — ®L 

V ds | s =0’ VV ~ dt |t=0 

d 2 E(f st ) 
dsdt |s=i=o 

= j (VV, VW) f ~i TN — j tra ce M (R N (df,V)W,df) f . 1TN 

M M 

+ J (V_a trac e M Vdf)f-i TN . (8.7.3) 

M 

If foo ^ harmonic, or if V = 0 for s = t = 0, then the second variation 
depends only on V and W, but not on higher derivatives of f w.r.t. s,t , and 

If{Y ' W) ^ d ~§t 1 = / < W >' VW h~'TN 

M 

-J trac e M (R N (df,V)W,df) f _ 1TN . (8.7.4) 


Proof. (8.7.3) follows from (8.7.1), (8.7.2). (8.7.4) holds if either V_a = 0 

dt OS 

or trace m V df = 0, and the latter is the harmonic map equation (cf. (8.1.14)). 

□ 

We look at the special case where we only have one parameter: 

f(x,t) = ft( x ),f ■ M x (-£,£) -> N, 

W := — 

dt \t=o 
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Corollary 8.7.1 Under the assumptions of Theorem 8.7.1, 


If(W,W)=—E(f t ) lt=0 


= I l|V^||2_ lrjv 

M 

- j trac e M (R N (df,W)W,df) f _ 1TN , 


(8.7.5) 


if f is harmonic or if f(x, •) is geodesic for every x. 

Proof. If f(x,-) is geodesic for every x, 

Va_^f = 0. 

e* dt 

All assertions follow from Theorem 8.7.1. □ 

Remark. For geodesics, the second variation of energy was already derived 
in Theorem 4.1.1. 


Corollary 8.7.2 Under the assumptions of Theorem 8.7.1, if N has non- 
positive sectional curvature, then a harmonic map is a stable critical point 
of the energy functional in the sense that the second variation of energy is 
nonnegative. 

Proof. If N has nonpositive sectional curvature, 

<• R N (df($),W(x))W(x),df ($ )) < 0 

for every x £ M, T> £ T X M , and every section of W of f~ 1 TN 1 and the claim 
follows from (8.7.5). □ 

B) We next want to calculate 

Mf) 

= A \t aP { x )9i j {f{x))f % x o.f 3 xP 

for a harmonic map / : M — > N. (Here /*<* := §£z). The computation may 
be carried out in the same manner as at the end of 3.3 and in 3.5. It is 
somewhat easier, however, to perform it in local coordinates. 

In order to simplify the computation, we introduce normal coordinates at 
x and at f(x). Thus 
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7a/3 9 ij(f (x')') ^ij > (8.7.6) 

and 

7 a0,s( x ) = 0, gij,k{f(.x )) = 0 (8.7.7) 

for all indices. Therefore, in our computations, we only have to take second 
derivatives of the metric into account; these will yield curvature terms. 

We rewrite the harmonic map equation (8.1.7) as 

o = 7 a0 {x)fU x e - l a0 (x)r^(x)f xV +^(x)r; k (f(x))f^f k x ,. (8.7.8) 

(Here, the Christoffel symbols of M have Greek indices, those of N Latin 
ones.) 

We differentiate (8.7.8) at x w.r.t. x e and obtain, recalling (8.7.6), (8.7.7) 

fx a x a x s 2 (7ar/,a£ + 7 ar),ote — "faa,r)e) f ? x >? 

- 9ki,lm + gU,km - 9kt,im)f™fx<*fi<*. (8.7.9) 

Moreover, by (8.7.6), (8.7.7) 

7 a/3 , £e = -7 a/3)££ (8.7.10) 

and from the chain rule 

= g ijM &&. (8.7.11) 


(8.7.9) - (8.7.11) yield 


= fx a x e fx a x e ~ 2^ a 9’ s£ ~k 7ee,a/3 — 7ea,e/3 ~ 7 e/3,eot) f x a f x P 
T 1^(9%], k£ + 9k£,ij ~ 9ik,jt. — 9j£,ik)f x a fx a fx e fx s 

ri ri , r>M ri xi tdN ri xj xk x£ 

Jx a x £ J x a x £ ' I *'af3Jx oc JxP - tr ^i£jkJx a Jx a Jx £ Jx £ 


(8.7.12) 


(cf. (3.3.15)), where 


R 


M _ 
a/3 


7 6e R 


M 

OiSfiE 


= R 


M 

otepe 


is the Ricci tensor of M, and R% jk is the curvature tensor of N. 

In invariant notation, if ei,...,e m is an orthonormal basis of T X M , 
(8.7.12) becomes 


-Ae(f)(x) =\\Vdf\\ 2 

+ (df(Ric M (e a )), df(e a )) f -i TN 
- ( R N (df(e a ), df(ep))df(ep),df(e a )) f - 1TN . 


(8.7.13) 
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Corollary 8.7.3 Let M be a compact Riemannian manifold with nonneg- 
ative Ricci curvature, N a Riemannian manifold with nonpositive sectional 
curvature. 

Let f : M — > N be harmonic. 

Then f is totally geodesic 7 (i.e. S7df = 0) and e(f) = const. If the Ricci 
curvature of M is (nonnegative, but) not identically zero, then f is constant. 

If the sectional curvature of N is negative, then f is either constant or 
maps M onto a closed geodesic. 

Proof. By Stokes’ theorem, 


J Mf) = o. 


Therefore, the integral of the right hand side of (8.7.13) also vanishes. Since 
the integrand is the sum of three terms which are all everywhere nonnegative 
by assumption, all three terms have to vanish identically. 

We first conclude 

IIW/II = 0, (8.7.14) 

hence Vd/ = 0 so that / is totally geodesic. 

Secondly, 

Mf) = o, 

and since harmonic functions on compact Riemannian manifolds are constant 
(cf. Corollary 2.1.2), 

e(/) = const. (8.7.15) 


If for some x £ M, 


R^fp(x) is positive definite, 


implies 


hence e(/) = 0, hence 


R«p{x)fUf X 0 = o 


df{x) = 0, 


e(f) = 0 


by (8.7.15), and / is constant. 

If N has negative sectional curvature, then 

(■ R N (df(e a ),df(e f3 ))df(e 0 ),df(e a )) = 0 

implies that df(e a ) and df(ep) are linearly dependent everywhere. Therefore, 
f(M) is at most one-dimensional. If the dimension is zero, / is constant, and 


7 


See Lemma 8.7.1 below 
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if the dimension is one, f(M) is a closed geodesic because / is totally geodesic 
and M is compact. 

(See Lemma 8.7.1 below) □ 

Remark. The method of proof of Corollary 8.7.3 is another instance of the so- 
called Bochner method which is very important in Riemannian and complex 
geometry. The prototype of the technique was already given in 3.5. 


Lemma 8.7.1 A smooth map f : M — > N between Riemannian manifolds is 
totally geodesic iff f maps every geodesic of M onto a geodesic of N. 


Proof. Let 7 (<) be a geodesic in M. Then 

dt at 9t at 

= C V T /df O 7 ) ( ) + d/C Vl M ^ ) 
dt at dt at 

d'y d'y 

= Vd/( — , — since 7 is geodesic. 

Thus (/ o 7 )(t) is geodesic iff Vdf(^, = 0. 

C) We finally want to derive and exploit a chain rule. If / : M 
and h : N — > Q are smooth maps between Riemannian manifolds, 

r(h o /) = trace \/d{h o /) 

7 ^dpd xP’ 

e dh dp dp 

= 7 


dfo df l ; dx a dx& 


v«/3^V 


dff 

df l dx@ 


□ 

N 


where Wdh is the Hessian of h (see Definition 3.3.5), and r(/) is the tension 
field of /. 

Thus 


Lemma 8.7.2 For smooth maps f : M —> N ,h : N — > Q between Rieman- 
nian manifolds, the following chain rule holds 

T (h o /) — 7 a/3 Vdh(^4, 77 + [dh) o (t(/)). (8.7.16) 


In particular, if f is harmonic 
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r(W)= 7 “' , Vc»>(JT,|i). (8.7.17) 

□ 

Remark. If Q = R, of course r = —A, where A is the Laplace Beltrami 
operator. 

Definition 8.7.1 g : N — » R. ( N a Riemannian manifold) is called subhar- 
monic if 

-Ag > 0. 

Corollary 8.7.4 If f : M —> N is harmonic, and h : N — + M is convex, then 
ho f is subharmonic, i. e. 

— A(h o /) > 0. 

Conversely, if f : M —> N is a smooth map such that for all open V C N 
and convex h : V — > R, with U := / -1 (C), 

ho f is subharmonic, 

then f is harmonic. 

Proof. (8.7.17) implies the first part. For the second part, if / is not harmonic, 
we may find some Xq £ M with 

T~(f)(x o) ^ 0. 

We then need to find a convex function h on some neighborhood V of f{x o) 
for which 

-A(ho f)(x 0 ) < 0. 

If N were Euclidean, we could simply take a linear function h, i.e. \7dh = 0, 
with (grad h)(f(x 0 )) = -r(/)(x 0 ). 

We then have 

-A(h o f)(x 0 ) = dho r(/)(x 0 ) 

= ((grad h)(f(x 0 )),T(f)(x 0 )) 

= -\\r(f)(x o )f<0. 

In the Riemannian case, in general, we may not find local functions with 
\7dh = 0, but if we consider sufficiently small neighborhoods V, we may find 
such functions ho for which ||Vd/io|| is arbitrarily small while we still have 
a prescribed gradient (grad ho)(f(x o)) = —r{f){x o). This follows from the 
definition of the Hessian X/dho , see (3.3.47), together with the fact that in 
Riemannian coordinates centered at f{xo),r l - k {f{xo)) = 0, see (1.4.12), and 
so P-f, can be made arbitrarily small in a sufficiently small neighborhood V 
of f(x o). 
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Still, ho is not convex, but since ||Vd/ioll is small, it can be made convex 
in a small neighborhood V of xq by adding a small multiple of d 2 (f(x o), •), 
the squared distance function from f(x o), using (4.6.6). Since that multiple 
is small, say e, the new function 

h = h 0 + ed 2 (f{ xq),-) 

can still be assumed to satisfy 

dh ° T (/)(® o) < 7 |^) 
i.e. 

-Ah o f( x 0 ) < 0. 

This completes the proof. □ 

Corollary 8.7.5 If f : M — > N is harmonic and if N has nonpositive sec- 
tional curvature, and is simply connected and complete, then for any p £ N 

-Ad 2 (f(x),p)>2\\df(x)\\ 2 

Proof. (8.7.17) and Lemma 4.8.2. □ 

Corollary 8.7.6 Let M be a compact Riemannian manifold, N a Rieman- 
nian manifold, f : M —> N harmonic. 

If there exists a strictly convex function h on f{M), then f is constant. 

Proof. By Corollary 8.7.3, ho f is subharmonic. The following Lemma shows 
that ho f then is constant. Since h is strictly convex, (8.7.17) implies / = 
const. □ 

Lemma 8.7.3 Let M be a compact Riemannian manifold. Then any subhar- 
monic function c p is constant. 

Proof. By Stokes’ theorem 

J Aip = 0 , 

M 

so that a subharmonic function is harmonic, hence const, by Corollary 2.1.2. 

□ 

Corollary 8.7.7 If f : M —> N is harmonic and h : N —> Q is totally 
geodesic, then ho f is harmonic. 


Proof. (8.7.17). 


□ 
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Perspectives. For more special domains, other Bochner type formulae for har- 
monic maps have been found. Here, we only want to quote two such formulae. 

Siu[225] derived the following formula that is actually valid for any smooth, not 
necessarily harmonic map between Kahler manifolds / : M — > N 

dd(g i jdf i A df j ) = R ljh ,i)f A Of A i)f A <>f< g^D'Bf A D"df j . 

Here, (gfj) is the Kahler metric of N in local holomorphic coordinates (f 1 , . . . , /”), 
Rfjkl its curvature tensor, F!j k its Christoffel symbols, 

D'Bf = dBf + r] k df j a Bf k 
D "df =Bdf j +rl k Bf a df k 

the covariant derivatives. 

The assumption that / is harmonic is needed if one wants to know the sign of 
the second term on the right hand side. Namely, in that case 

gijD'Bf A D"8f A /- 2 = qu> n 

for some nonpositive function q on M, where u> is the Kahler of M. Furthermore, 
if the curvature tensor is “strongly seminegative”, then the first term on the right 
hand side is a nonnegative multiple of u n , and integration by parts then gives as in 
the proof of Corollary 8.7.3 that under these conditions, a harmonic map / satisfies 

D'Bf = D"df = 0 . 


This means that / is pluriharmonic. 

If the curvature of N is even “strongly negative” and if the real rank of df 
is at least 3 at some point, then Siu showed that / has to be holomorphic or 
antiholomorphic. If N is a Riemann surface of negative curvature then the real 
dimension of the image is 2, hence rank^ df < 2 and Siu’s result does not apply. 
Nevertheless, in that case, Jost and Yau[146] showed that the level sets of / still 
define a holomorphic foliation of M although / itself need not be holomorphic. 

We now want to derive a Bochner type identity for harmonic maps from Ein- 
stein manifolds, due to Jost and Yau[148]. In order to simplify the formula and its 
derivation, we always use normal coordinates at the point under consideration and 
denote (covariant) derivatives by subscripts, e.g. 


The formula then is 


/< 


^’ V ^ V 


d 

dx& 


Theorem. Let f : M — > N be a harmonic map between Riemannian manifolds, 
where M is compact and Einstein. Then for any A £ K 

A J (fa/3, fa/3) +2 J Rap-ylifaS, //3-y) 

M M 

= -A j R™ p {f«,fp)- J R^ s R^ t {fa,f v ) 

M M 

+ A J (R N (f a ,fp)f 0 ,U) 

M 

+ J R^6(R N (f-y,fs)f 0 J a ). 

M 
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Let us give the proof: We start with (8.7.13), i.e. 

=^A(f a , f a ) = {{f a p, f ap ) + Rap{for, U) - (R N (U U)U, /«>} ■ 
We compute 

(V 7 V 5 -ViV 7 )(/ a da:“) 

=(—Rpaysfpdx a ) + R N (fy, fs)f a dx a . 

From (2), 

((V 7 V«s - V5V 7 )/ a dx“, (V 7 Vj - VaV 7 )/ / 3da: /3 ) 

=Rpay S R^s(fp, fr,) + (R N (fi, ff)fa, R?{fy, fi)fa) 

-2 R^s{R N (f^fs)fp,f a ). 

Denoting the L 2 -product on T*M <g) f*TN by (•, •), we get 


((V 7 V«s - X7 -,) f a dx a , (V 7 Vs - Vs'V^fpdx' 3 ) 

= {-Rp aiS fpdx a , (V 7 V« - V«V 7 )/„da: , ') 

+ ( R N (f t, f S )f a dx a ,(VyX7 s - X7 s X7^)fpdx 0 ) 

= 2(-R^sfdx a ,\/^sf v dx v ) + j {R N (f^,f S )f a ,R N (fy,fs)fa) 

M 

- J R^ s {R N (fy,fs)fp,fc,). 

M 

Integrating the first term by parts, we get 

= 2 j {^(R^sfp)Ja S ) + J (R N (f~n fs)fc R N U-y, fe)f a ) 

M M 

- J R«w(R N (f^fs)fpJc,). 

M 


Now 


(^(Rpa-ysMJcs) 

=Rpocyg{fpy, fas) + R[Ly6,y (ff3 , f ad) ■ 


The 2nd term vanishes for any Einstein metric since 

jyM / pM pM \ 

**'/3at'y5,'y \ jrt '5'yoi'y,(3 -^8^(3^ ,a ) 

by the Bianchi identity, and this vanishes if the Ricci tensor is parallel. 
We obtain for an Einstein metric on M 

((V 7 V« - VaV 7 )/ a da:“, (V 7 V« - V aV y)fpdx 0 ) 

S=-2 J R$ h s{faS,f(n)+ j (R N (fy, fs)fc,R N (fy, fs)fa) 

M M 


- J R^ys(R N (fy,fs)UJ a ). 

M 


( 1 ) 

(2) 

(3) 


( 4 ) 
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From (1), (3), (4) we get the desired formula. For the special case N = M, the 
formula is simpler and due to Matsushima[175]. 

For an application of the formula, see the Perspectives on 8.10. 

A general discussion of identities for harmonic maps and applications can be 
found in Xin[260]. 

The characterization of harmonic mappings given in Cor. 8.7.4, i.e. that a 
(smooth) map between Riemannian manifolds is harmonic if and only if locally 
the composition with all convex functions is subharmonic has been observed by 
Ishihara[124]. 

It might be tempting (and it has been proposed) to use that characterization for 
an axiomatic approach to harmonic maps. That, however, would loose the deeper 
aspects of harmonic maps based on their variational properties. It will be explained 
in the next § that a rather general and satisfactory theory can be developed for 
harmonic mappings with values in Riemannian manifolds of nonpositive sectional 
curvature based on an abstract variational approach. By way of contrast, a char- 
acterization analogous to Cor. 8.7.4 can also be obtained for solutions of other 
nonlinear elliptic systems for maps between manifolds that need not have a varia- 
tional origin. For example, Jost and Yau[147] considered the system 


3 ( d 2 f , df^df^ 

dz a dz 0 jk dz a dz 13 


for maps / : X — > N where N is a Riemannian manifold as before, but X is 
a Hermitian manifold with metric ( 7 a ^)a,/ 3 =i, ■■■,dim c m- The preceding system is 
equivalent to the harmonic map system if the metric (j a p) is a Kahler metric, but 
not for a general Hermitian metric, and in fact, in the general case, it need not arise 
from a variational integral. Analogous to Cor. 8.7.4, solutions can be characterized 
by the property that local compositions with convex functions h : V ( C N) — ► R. 
satisfy 

/S d 2 (hof) 

dz a dzP 


> 0 . 


However, as examples show, one does not always get the existence of solutions of the 
new system in a prescribed homotopy class of maps / : X — > N, N of nonpositive 
sectional curvature, as in the existence theory for harmonic maps. The reason for 
the failure of the existence theory is the lack of variational structure. We refer to 
[147] for details. 


8.8 Harmonic Maps into Manifolds of Nonpositive 
Sectional Curvature: Existence 

Let M and N be compact Riemannian manifolds, N of nonpositive sectional 
curvature. In this §, we wish to show that any continuous map g : M — > N 
is homotopic to some - essentially unique - harmonic map. This result will 
be deduced from convexity properties of the energy functional E that follow 
from the assumption that the target manifold N has nonpositive sectional 
curvature. The relevant geometric results have been collected in § 4.8 already. 

As an application in § 8.1, we shall derive Preissmann’s theorem about 
the fundamental group of compact manifolds of negative sectional curvature. 
Further applications will be described in the Perspectives. 
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A continuous map 

g-.M^N 

TTi (M,p) -> 7ri(Ar )S f(p)) 

point in Af). As described in Appendix B, we 

g : M —> N 
to universal covers that is p-equi variant, i.e. 

g{Xx) = p(X)g(x) 

for all x £ M, A £ 7Ti (M. p) where the fundamental groups tti(M,p) and 
7r 1 (A r , g(p)) operate by deck transformations on M and N, resp. 

Y := N is a simply connected complete Riemannian manifold of nonposi- 
tive sectional curvature. In particular, all the results derived in § 4.8 for such 
manifolds apply. We let 

d:Y xY -+] R 

be the distance function induced by the Riemannian metric, as always. 

For p-equivariant maps 

h!,h 2 '■ X := M -> Y, 
we can define an L 2 -distance by 

d(h 1 ,h 2 ) := (/ d 2 (hi(x) , h 2 {x)) dvol(Af)^ , 

where the integration is w.r.t. the volume form of the Riemannian metric on 
M and over some fundamental domain of M in X = M . The p-equivariance 
of h\ and h 2 implies that this integral does not depend on the choice of 
fundamental domain. 

We then put 

Z := L 2 (M,N) :={h : X — * Y p-equivariant 
with d 2 (h, g) < oo}. 

Z = L 2 p (M,N) then is a complete metric space; the completeness is shown 
as for the standard spaces of L 2 -functions (that result is quoted in Theorem 
A. 1.1 in the Appendix), because Y is complete. 

Curves in Z are simply given by families 

(/t)t€[0,l] 


induces a homomorphism 

P = 9t ■ 

of fundamental groups ( p any 
may then find a lift 
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of p-equivariant maps f t : X — » Y, depending continuously on t. We say that 
such a curve is a shortest geodesic if 

for all t £ [0,1]. (It is not difficult to show that this property character- 
izes shortest geodesics in Riemannian manifolds, and so it is natural to use 
this property also in other metric spaces.) It is then easy to describe such 
geodesics: 

Lemma 8.8.1 For every x € M , let 7 X : [0,1] — > N be a shortest geodesic 
with 7 a , (0) = /o(a:),7x(l) = fi(x), chosen equivariantly, i.e. p( A)y x = 
for all x, A. Then the family of maps 

ft{x) :='J x (t), te[o,i] 

defines a shortest geodesic in L 2 (M,N) between fo and f\. 


Proof. 


d 2 (f 0 ,ft) = J d 2 {f 0 {x)J t {x)) dvol(a:) 


= J t 2 d 2 (f 0 (x),f 1 (x)) dvol(x) 

= t 2 d 2 (fo, /i). 


because 7a, defines a shortest 
geodesic from fo(x) to fi{x) 


□ 


Thus, if fo and /i are p-equi variant maps, the geodesic in L 2 (M , N) from 
fo to fi is simply obtained by taking for each x € M the shortest geodesic 
from fo(x) to fi(x) and defining maps ft through this family of geodesics. 

Corollary 8.8.1 Let Ci,C 2 : [0,1] — > L 2 (M,N ) be shortest geodesics. Then 
d 2 (c\(t), C 2 (t)) is a convex function oft. 

Proof. We can use Lemma 8.8.1 to derive this property by integration from 
the corresponding property of N that has been demonstrated in Theorem 
4.8.2. Namely, by Theorem 4.8.2, for each x £ N, if is the shortest 

geodesic from Cj(0)(x) to c*(l)(x), i = 1,2, then 

d 2 { 7i > xW,72,x(*)) 

is a convex function of t. But then also 

d 2 {c\{t) 1 c 2 (t)) = J d 2 (7i,x(i),72,xW) dvol(M) by Lemma 8.8.1 


is a convex function of t. 


□ 
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Similarly, if c: [0, 1] — > L 2 p {M, TV) is a shortest geodesic, and 2 € L 2 (M , TV), 
we have the analogue of (4.8.7) 

d 2 (c(t),z) < td 2 (c(l),z) + (1 — t)d 2 (c(0),z) (8.8.1) 

-f(l-t)d 2 (c(0),c(l)) 

for all t € [0, 1]. 

We now consider the functionals E e and E defined in § 8.4, but this 
time, we define them on the space L 2 (M, TV), carrying out all corresponding 
integrals on a fundamental domain for M in M . 

Corollary 8.8.2 E e and E are convex functionals on L p {M,N), in the sense 
that for any shortest geodesic c : [0, 1] — > L 2 (M , TV), 

E £ (c(t)) and E(c(t)) 


are convex functions oft. 

Proof. As explained in Lemma 8.8.1, such a shortest geodesic is given by a 
family of p-equivariant maps 

ft : M -» TV 

such that for each x £ M, ft(x) is geodesic w.r.t. t. 

Applying Theorem 4.8.2 to the geodesics ft(x) and ft(y), we obtain 

d 2 (ft(x),f t (y)) < td 2 (fi(x), fi(y)) + (l~t)d 2 (f 0 (x)Jo(y)). 

Integrating this inequality w.r.t. x and y as in the definition of E e {f) (cf. 
(8.4.3)) yields the convexity of E e , and the convexity of E follows by passing 
to the limit e — > 0 as explained in § 8.4. □ 


We are now ready to start our minimization scheme for the functionals E e 
and E on the space Z = L 2 (M,N). In fact, we shall demonstrate a general 
result about minimizing convex and lower semicontinuous functionals (recall 
Lemma 8.4.1 and Theorem 8.4.2) on Z\ in fact, the constructions will be valid 
for more general spaces than Z as the only essential property that we shall 
use about Z is the convexity property of Corollary 8.8.1. 

Definition 8.8.1 Let F : Z — > RU {oo} be a function. For A > 0 ,z £ Z, the 
Moreau-Yosida approximation F x of F is defined as 

F\z) := inf (A F(y) + d 2 (y, z)) 
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Lemma 8 . 8.2 Let F : Z — > RU{oo} be convex, lower semicontinuous, ^ oo 
and bounded from below. For every A > 0 , z £ Z, there exists a unique y x £ Z 
with 

F x (z) = XF(y x ) + d 2 (y x ,z) (8.8.2) 

Proof. We take a minimizing sequence (y n )n€ n for F x (z). This means that 
lim (A F(y n ) + d 2 (y n , z)) = F x = inf (A F(y) + d 2 {y , z)) (8.8.3) 

n— >oo yeZ 

For j/ m ,y n € Z, we take a shortest geodesic 7 : [0, 1] — > Z with 7(0) = 
y m , 7(1) = y n and define the midpoint as 

Hm,n = 7(2) 

The convexity of F then implies 
F x < XF(y m , n ) + d 2 (ym tn , z) 

< ^ XF{y m ) + ^A F{y n ) + d 2 (y m , n , z) by convexity of F (8.8.4) 

— 2 T 2^^ (.Un) T 2^ (dmi z) + — d (y n , z) — —d(y m , y n ) 
by (8.8.1) 

Since, by (8.8.3) (A F(y m ) + d 2 {y m , z)) and (A F(y n ) + d 2 (y n , z)) converge to 
F x , we conclude that d 2 (y mi y n ) has to tend to 0 as m, n — > 00. Thus (y n )ne n 
is a Cauchy sequence in Z, and so it tends towards some limit y\. Because 
F is lower semicontinuous, (8.8.2) then follows from (8.8.3). □ 

We can now state our abstract existence result: 

Theorem 8 . 8.1 Let F : Z — > R U {00} be a convex, lower semicontinuous 
function that is bounded from below and not identically +00. Let y\ be as 
constructed in Lemma 8.8.2 for A > 0. 

if(y\ n ) bounded for some sequence X n — > 00, then ( y\)\>o converges 

to a minimizer of F as X — > 00. 

Proof. Take any z £ Z. By definition of y\ n , y\ n minimizes F(y) + j-d 2 (y, z). 
Since y Xn is bounded and X n tends to 00, {y Xn ) n £H therefore constitutes a min- 
imizing sequence for F. We claim that d 2 (y x ,z) is a nondecreasing function 
of A. To see this, let 0 < y < X. 

By definition of y^ 

F(y\) + —d 2 (y x , z) > F(y^j + -d 2 (y^.,z). 

y y 
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This implies 

F(y\) + jd 2 (y x ,z) > F(y^) + jd 2 (y^z)+(^ - -Q (d 2 (y^,z) -d 2 {y x ,z)) . 

This is compatible with the definition of y x only if 

d 2 (y^,z) < d(y x , z) 

showing the claimed monotonicity property of d 2 (y x , z) . 

Since d 2 (y Xl z) is bounded on the sequence (A n j„ e N tending to oo and 
monotonic, it has to be a bounded function of A > 0. It follows from the 
definition of y x that 

F{y x ) = inf{F(y) : d 2 (y,z) < d 2 (y x ,z)}. 

Since d 2 (y x ,z) is nondecreasing, this implies that F(y x ) is a nonincreasing 
function of A, and as noted in the beginning, it tends to inf F{y) for A — > oo. 

y€iZ 

Let now e > 0. By the preceding boundedness and monotonicity results, we 
may find A > 0 such that for A, y > A 

\d 2 (y x ,z)~d 2 (y„z)\< £ -. (8.8.5) 

If A < y < A, we have F(y fJi ) > F(y x ) as F(y x ) is nonincreasing. If y^ x is 
the midpoint of y M and y x as in the proof of Lemma 8.8.2, we obtain from 
the definition of y ^ 

F{yy) + - d 2 (y l _ l ,z ) <F(y x + -d 2 {y x ^,z) 

y y 

<F(y\,v) + j^iy^z) + - - ^ d 2 {y x , yfl )) 
by (8.8.1) and (8.8.5). 

Also, by convexity of F, and since F{y fJt ) > F(y x ), F{y Xifl ) < F{y tl ). There- 
fore, 

d(y x ,yu) < £■ 

Thus, (j/a)a>o is a Cauchy family for A — > oo. 

Since Z is complete, there then exists a unique y <*> = lim y x . Since we 

A — >-oc 

have already seen that 

lim F(y x ) = inf F(y), 

\—>oo y£ Z 

the lower semicontinuity of F implies that 
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In order to apply Theorem 8.8.1 to show the existence of a minimizer of 
E (or, by the same argument, for the functionals E e ), we need to verify that 
in our situation the y\ in the statement of Theorem 8.8.1 remain bounded. 
This is the content of the proof of 

Theorem 8.8.2 Let M and N be compact Riemannian manifolds, N of non- 
positive sectional curvature. Then every continuous map g : M — > TV is ho- 
motopic to a minimizer f of the energy E, in the sense that E achieves its 
minimum in the class L 2 (M,N) of p-equivariant maps between the universal 
covers M and N, where p : 7 Ti(M) — > tti(N) is the homeomorphism of funda- 
mental groups induced by g. (We shall verify subsequently that f is smooth, 
and so in particular continuous.) 

Proof. We first consider the case where g(M) is simply connected. It is not 
difficult to verify that in that case, g is homotopic to a constant map (and 
a constant map obviously minimizes the energy). Since that verification is 
instructive for the general strategy, we proceed to perform it. Let y$ £ g(M)- 
For each y € g(M), we choose a curve j y from y 0 to y. Let c y : [0, 1] — > N 
be the geodesic from yo to y homotopic to j y . It is unique because N has 
nonpositive sectional curvature (Theorem 4.8.1), and it does not depend on 
the choice of 7 ^, because any two curves in g(M) from z/o to y are homotopic 
to each other as g(M) is simply connected. We put 

9t(x) = c g{x) (t). 

gt{x) is continuous w.r.t. t, and also w.r.t. x, because 

d 2 (c yi (t),Cy 2 (t)) < td 2 (c yi (l),c y2 (l)) by Corollary 4.8.3. 

Since g 0 = y 0 , gi = g,g t provides a homotopy between a constant map and 
g , as desired. 

If g(M) is not simply connected, we choose some closed curve 7 in g(M) 
that is not homotopically trivial. Let c be a closed geodesic in N that is 
homotopic to 7 (Theorem 1.4.6). Let g £ L 2 (M,N) be the lift of g to uni- 
versal covers. In order to apply Theorem 8.8.1, we have to exclude that the 
L 2 (M , iV)-maps y\ constructed for z = g in Lemma 8.8.2 become unbounded, 
i.e. that the L^-distance between g and y\ becomes unbounded for A — > 00 . 
y\ projects to a map g\ : M —> N homotopic to g. Let 7 a be a closed curve 
in g\{M) that is homotopic to 7 . Let x £ M with g(x) £ 7 , and yo £ c. Let 
Ca : [0, 1] — > N be the geodesic from y 0 to g\{x) in the homotopy class deter- 
mined by a homotopy between g and g\. Let b\ be the geodesic loop (which 
exists by Theorem 1.4.6) from g\(x) to itself that is homotopic to cxccf 1 . 
Thus, b^cxccf 1 is homotopic to a constant curve. Likewise, let 6 a, t be the 
geodesic loop based at c\(t) homotopic to ca| [o,z] c ( c a| [ o,z ] ) — 1 - By lifting to the 
universal cover N, we see that the energy E(bx,t) becomes the squared dis- 
tance between two different lifts of Ca, i.e. two geodesics, and so it is convex 
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by Theorem 4.8.2. Since c = b \ is a shortest geodesic, E(b\j) is minimal 
at t = 0. Thus, assuming d 2 (g,g\) — * oo for A — » oo,E(b\ tt ) either tends to 
a constant function, or E(b\ t i) goes to oo. In the latter case, however, the 
lengths of all curves in g\(M) homotopic in N to c would also go to oo, and 
that would let the energy of g\ tend to oo as well, in contradiction to g\ 
being a minimizing family for A — > oo by the proof of Theorem 8.8.1. 

If the lengths are constant, i.e. bx , i is asymptotically of the same length as 
b\'Q = c, we either find another homotopy class of curves for which the length 
goes to oo - which is impossible as already argued - or the length remains 
constant for all homotopy classes. In that case, however, the construction of 
the Moreau- Yosida approximation implies that d 2 (g,gx) cannot tend to oo, 
because E is not changed, while d 2 (g,g\) is decreased if we move the image 
of M closer to c along the curves c\ (’’closer” here refers to the lifts to the 
universal cover N), i.e. replacing x by c\(t) for t < 1. 

Thus, in any case, d 2 (g,g\) stays bounded, and Theorem 8.8.1 yields the 
result after all. □ 


Perspectives. See the Perspectives on § 8.10 


8.9 Harmonic Maps into Manifolds of Nonpositive 
Sectional Curvature: Regularity 

In the preceding §, we have shown the existence of a minimizer of the energy 
functional E in a given homotopy class, or more precisely, in the class of L 2 - 
maps that induce the same action by deck transformations on the universal 
covers as some given continuous map g. It is the purpose of this § to show the 
regularity, i.e. the smoothness of such a minimizer. In fact, we shall present 
different regularity proofs with the purpose of showing a more representative 
sample of techniques from geometric analysis. 

It is clear that a minimizer / of E is a critical point of E in the sense of 
Definition 8.7.1. Namely, in § 8.1, we have computed that for a compactly 
supported vector field ip along / and ft(x) = exp^^-j tip(x), 

jE{f t ) |t=o = J(df,diP). 

Thus, in particular, E(f t ) is a differentiable function of t, and since f = fo 
minimizes E, this derivative at t = 0 has to vanish, for all such ip. 

If k is some smooth function on the image of /, and if tp is a smooth 
function on M with compact support, we may consider the test vector 


{dk) o f(x) ■ ip(x). 
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We obtain (referring to § 8.1 for the notation) 

0 = J (df,dtp) = j ( df,d(dk)ip(x )) , with dk being evaluated at f(x ) 

= f <p(x)(df 1 X7 a ( dk)®dx a ) 

J v** 

+ J(d<p(x),d(ko f)(x)) (8.9.1) 

= J <p(x)Vdk(df,df)(: r) 

+ J (dip(x), d(k o f)(x)), recalling (8.1.11) and (3.3.48). 

We now take 

K z ) = \d 2 (z,p), 

lifting to universal covers as always. 

By Lemma 4.8.2, 

Vdk(df,df) > \\df\\ 2 . 

Inserting this into (8.9.1) yields 

J (dcp(x),d(ko f)(x)) < - J ip{x)\\df(x)\\ 2 . (8.9.2) 

(8.9.2) means that k o f is a weak subsolution of 

—A(k o /) > ||d/|| 2 - (8.9.3) 

(Cf. Corollary 8.7.5 for the corresponding result in the case where f is a 
smooth harmonic map.) 

We shall now use this differential inequality to derive the Holder continuity 
of our minimizer /. Theorem 8.6.1 will then imply that / is smooth. 

The same argument actually shows that for any smooth convex function 
k on the image of /, we have 

-A(kof)>0. (8.9.4) 

In the sequel, however, the functions k(z) = \ d 2 (z,p ), for various choices of 
p, will entirely suffice. 

We shall need a version of the Poincare inequality 

Lemma 8.9.1 Let M be a compact Riemannian manifold, Y the universal 
covering of a Riemannian manifold of nonpositive curvature. Then there exist 
ro > 0 and a constant Co < oo such that for any ball B(xo,r) C M, 0 < r < 
ro, and any L 2 -map with finite energy, 
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/ : B(x 0 ,r) -> Y, 


the following inequality holds 

J d 2 (f{x)J B )<c 0 r 2 j ||<i/(V)|| 2 , 

B(x 0 ,r) B(x 0 ,r) 

where f B G Y is the center of mass of f, i.e. f B minimizes 


d 2 (f(x),p) dvol(a’) w.r.t. p GY. 


B(x o,r) 


(8.9.5) 


Proof. The factor r 2 on the rhs of (8.9.5) comes from a simple scaling argu- 
ment; such a scaling argument is possible because for sufficiently small r > 0, 
the geometry of the ball deviates to an arbitrary little degree from the one 
of a Euclidean ball of the same radius. Thus, we neglect the factor r 2 in the 
sequel. 

If the inequality (8.9.5) then is not valid, we can find a sequence (/ n )neN 
of maps from some such ball B(xo,r) into Y for which 

J d 2 {f n {x)i f niB ) > n J ||d/„(a;)|| 2 (8.9.6) 

Since Y has a compact quotient, we may compose f n with deck transforma- 
tions, i.e. isometries of Y, which leave both sides of (8.9.6) invariant, such 
that / n> _e always stays in some compact region of Y . Thus, we may assume 
that the f n ^ B converge to some p € Y. If the lhs of (8.9.6) happens to be 
smaller than one, we may rescale Y, i.e. we consider the chart 

exp p : T p Y — > Y 

and replace the Riemannian metric gij{z) of Y in this chart by the metric 
gtj(pz) for a suitable p > 1. This multiplies the distance function d and 
the norm || • || by a factor p which we can thus adjust to make the lhs of 
(8.9.6) equal to 1. The curvature of Y gets multiplied by X, and as p — > oo, 
the rescaled Riemannian manifold (Y, gij{pz)) becomes Euclidean, and the 
Poincare inequality reduces to the Euclidean one. 

We now turn to the case where the lhs of (8.9.6) is bigger than 1. 

For any map g : B(xo,r) — > Y, we may perform the following construc- 
tion: 

9t (x) := exp f (exp g(x)), for 0 < t < 1. 

Thus, for any x, 

d(g t (x),g B ) = td(g(x),g B ), 
and since g B is characterized by the property that 


(8.9.7) 
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J expg^ffO)) dvol(a;) = 0, 

we see that 

9 b = gt,B , 

i.e. gs remains the center of mass for the maps gt- 
Since Y has nonpositive curvature 

d(gt{x),g t {y)) <td(g(x),g(y)) for all x,y,0 < t < 1, (8.9.8) 

by (4.8.8). Therefore also 

\\dgt(x)\\ 2 < t 2 \\dg{x)\\ 2 , (8.9.9) 

whenever this expression is well defined. 

For each n £ N for which the lhs of (8.9.6) should happen to be bigger 
than one, we choose t = t n ,0 < t < 1, such that 

= 1 - 

Because of (8.9.7) and (8.9.9), we may then replace f n by f n .t. without making 
(8.9.6) invalid, and so, we may assume wlog 

J d 2 (f n (x), fn }B ) = 1 for all n G N. (8.9.10) 

Then 

J ||d/„(a;)|| 2 -> 0 for n -> oo, 

and therefore f n has to converge to a constant map fo=P for some p £ Y. 
By Rellich’s theorem (see Theorem A. 1.8; the standard proof for functions, 
see, e.g. J. Jost, Postmodern Analysis, Springer, 1998, p. 265 ff., carries over 
to maps with values in Y, because we have constructed in § 4.8 the modifiers 
on which that proof depends) 

J d 2 {f n {x),f n ,B ) 

converges to 

J d 2 {fo{x)Jo,B ) = J d 2 (p, p) = 0. 

This, however, contradicts (8.9.10). This concludes the proof. □ 


Let us also present an alternative proof of the Poincare inequality that 
does not use Rellich’s theorem, but rather employs the constructions of § 4.8 
directly: 
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By (4.8.21), 

d{f(x),f B )< j d(f(x),f(y))dy. 

B(xo,r) 

We may work with the Euclidean volume form on dy on B ( xq , r) induced by 
the exponential map exp a , o : T Xo M — > M, rather than with the Riemannian 
one. Since the two are uniformly equivalent, this will only affect the constant 
Co in the estimate. In other words, we assume that B(xo,r) is a Euclidean 
ball 

{y e R m : d(x 0 ,y) = \x 0 - y\ < r}. 

We may also assume that / is differentiable, because a general / may be 
approximated by the differentiable mollified maps fh as explained in § 4.8. 

Then 


\ x ~y\ 


d(f(x),f(y)) < 


l f(x+r y~ x ) 

dr \y ~ x\ 


dr 


(the meaning of should be obvious) 


and so 


d(f(x),f(y))dy < 


mum J \x ■ 


-y^iWdfMWdy’ 


for m = dim M. oj rn = volume of the m-dimensional unit sphere. 
Therefore 


d 2 (f(x),f B )dx < 


< 


d(f(x),f(y))dy) dx 

1 


< 


x _^zr\\df(y)\\dy) dx 
l 

X — ' 


-^ZT\\df(y)\\ dy 

FIF*) dx 

by Holder’s inequality (Theorem A. 1.2) 

2 


1 


W(y) || 


1 


\x-y\' 


—dx) dy 


Since 


by Fubini’s theorem. 

_ ^| m _i dx < mu} m r for all y £ B(x 0 ,r), 


B(xo,r) 
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we obtain 

J d 2 (f(x),f B )dx<r 2 j \\df (x)\\ 2 dx 

B(x 0 ,r) B(x 0 ,r) 

and the constant Co arises from estimating the Euclidean volume dx against 
the Riemannian volume dvol(;r). In fact, employing Riemannian normal co- 
ordinates at xo, we see that this yields a factor of magnitude (1 + Cir 2 ). □ 

In the sequel, we shall assume that the radii R of all balls B(x o, R),x o £ 

M, are smaller than the injectivity radius of M. We then do not need to 
distinguish between such a ball and its lift to the universal cover M. Also, 
on such a ball, the negative Laplace-Beltrami operator in local coordinates, 

“ in § 8 

is of the type considered in A. 2, and therefore on such a ball, the Harnack 
inequalities stated in Theorem A. 2. 2 hold. 

By the Harnack inequality (Theorem A.2.2(i)), we have for Xq £ M,p £ 

N, m = dim M, 


sup d 2 {f{x),p) < c 2 
B(xo,r) 

because of the inequality 

- Ad 2 {f{x),p ) > 0 (8.9.12) 

that follows from (8.9.3). 

In order to control the rhs of (8.9.11), we observe that we can control 

d(p, /b) 

where f B is the center of mass of f on B(xo,2r), because / is in L 2 . We 
therefore need to estimate 

J d 2q {f(x),f B ) dvol(x). 

B(x 0 ,2r) 


^ J d 2q (f (x) , p) dvol(a;) J for q > 1, 

V B(x 0 ,2r) 


(8.9.11) 
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As in the second proof of the Poincare inequality, we have (replacing again 
clvol(;r) by the Euclidean volume element dx) 


d 2q (f(x),f B )dx < 


d(f(x),f(y))dy) dx 


2 q 


< 


< 


1 


m 2q U>m 

1 

ni 2q UJm 


1 


2 9 


\ X -y\rr>-l \W(y)\\dy) dx 

1 \\df(y)\\ 2 dy S j 

\ q 


x 


-y\ 




1 


2 q dy 


I I (m— 1) 

\x — y\ 1+9 
„ \ q ~ 1 

\\df(y)\\ 2 dyj dx 

(by Holder’s inequality (Theorem A. 1.2) with exponents pi = 2, 
p 2 = 2 q, p 3 = and writing 


' — 2/1 


nlM/ll 


m— 1 


1 — y\ 


m — 1 


l 


l 


l 


\x — y\ m ~ l 

\ 9+1 


WdfW 1 *] {\\dfW 1 - 1 *} 


m 2q LUm 


. \x — y| (m_1)l +g / 

by Fubini’s theorem as in the second proof the Poincare inequality. 
Now 


dy 


\\df(y)\\ 2 dy) ) 


1 


2 q 


, ,x^rdy< oo if - — < 

I ( m— !) TT^ 


m 


\ x — y|i TO_1 il+g " 1 + 9 TO-1 

and if we choose q > 1 satisfying that condition, we can bound d 2 (f(x),p) 
by d 2 {pJ B ) and 

J \\df(y)\\ 2 dy. 

B(x 0 ,2r) 

(The first proof of the Poincare inequality given above can also be strength- 
ened to yield the present stronger conclusion, by making use of Kondrachov’s 
extension of Rellich’s theorem, see Theorem A. 1.8.) 

In particular, d 2 (f(x),p) is bounded on B(x o, r), since / has finite energy. 
We record this as 


Lemma 8.9.2 Let f : B(xo,4r) — > Y (complete, simply connected, nonposi- 
tive sectional curvature) be a map of finite energy, satisfying 
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—Ad 2 (f(x),p) > 0 weakly for all p € Y. 
Then f is bounded on B(xo,r). 


□ 


Lemma 8.9.3 Let f : B{xq 1 At) — * Y satisfy —Ad 2 (f(x),p) > 0 weakly for 
every p £ Y where i?(xo,4r) is a ball in some Riemannian manifold M, 
0 < 2 r < i(M) and Y is a manifold of nonpositive sectional curvature, the 
universal cover of a compact manifold N. Let 0 < «q < k < Kq, and suppose 
that 

diam f(B(xo,2r)) := sup d(f(x 1 ),f(x 2 )) = K,. 

x 1 ,x 2 eB(x 0 ,2r ) 

There exists e > 0 depending on the geometry of M and N and on kq and 
Ki with the property that if 0 < e < £o and 

f(B(x 0 ,2r)) 

is covered by k balls B \, . . . , Bk of radius e, then 

f(B(x 0 ,r)) 

can be covered already by k — 1 of those balls. 

□ 


Proof. Since we may obviously assume that each ball Bi contains some point 
f{xf) we have 


Bi c B(pi,2s), with pi = f(xi),i = 1, . . . ,k. (8.9.13) 

If we assume s < £o < the balls 

K . 

B{pi,-), i = l,...,K, 

cover f(B(xo,2r)). Since its diameter is k, f(B(xo,2r)) is contained in some 
ball of radius at most 2k. Because the geometry of Y is uniformly controlled 
as Y admits a compact quotient 8 , there is some integer k\ such that any 
such ball of radius <2 k < 2kq contains at most k\ points whose mutual 
distance is always at least |. Therefore, already k\ of the balls B(pi, |) cover 
f{B{x o, 2 r)), say for * = 1, ... , Aq. 

8 actually, what is needed at this point is solely a lower bound on the Ricci curvature 
of Y, combined with the assumption that Y has nonpositive sectional curvature, 
but we do not pursue this issue here 
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Therefore, for at least one of those Pi, say for pi, 
meas (f^ 1 (B(p 1 , J))fl%,r)) 

> meas (B(xo,r)) 
ki 

> —r m (8.9.14) 

fci 

for some constant p > 0 depending on the geometry of M. 9 
We consider the auxiliary function 

g(x) := ^d 2 (j?i,/(a:)). 

We put 

M : = sup g(x) < -k (diam (f(B(x 0 ,r)))) < 1. (8.9.15) 

x£B(xo,2r) ^ 

By the triangle inequality, and since diam (f(B(xo,2r))) = k, there also has 
to exist some y £ B(xo,2r) with 


hence 


On / 1 (B(p 1 , f)), we have 


d(f(y),Pi) > 


1 

4' 


sM < T. 


We consider the auxiliary function 

h(x) := p — g(x) > 0 on B(x o, 2 r), (8.9.16) 

and 

h(x) > ^ on r\B( Pll ^)). (8.9.17) 

By (8.9.12) and the definition of g and h, we also have 
—Ah(x) < 0 weakly in B{x$,2r). 

Because of (8.9.17), we may apply the Harnack inequality Theorem A. 2. 2 (ii) 
to obtain 


9 g is controlled from below by an upper bound for the sectional curvature of M, 
but again this is not pursued here 
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inf h(x) > So — [ h(x)dx for some So > 0 

?Oo ,r) r m J 

B(xo,r) 

> S for some S > 0, 

by (8.9.17) and (8.9.14). 


(8.9.18) 


This inequality now implies that for sufficiently small e, we cannot have 

f(B(xo,r))r\B(pi,2£)^® for all* = 1, . . . , k. (8.9.19) 

Namely, the balls B(pi, 2e) cover f(B(xo,2r)), and thus, if the supremum is 
realized in (8.9.15) for y € B(xo,2r), i.e. 

\d 2 (pij(y)) = p, 


we can find some pi with 


d(pij(y)) < 2e. 


So, if (8.9.19) held, we would have d{f{x\),f{y)) < 4e for some x\ G B{x o, r), 
and thus 

inf h(x) < h(xi) = p ^d 2 {p ll f(xi)) 


B(xo,r) 


< 16 


£y/p 


which contradicts (8.9.18) for 


e < 


Sn\ 

"16"' 


Thus, for such an e, f{B{x o, r)) is disjoint to one of the balls B(pi, 2e), hence 
also to one of the balls _B;, because of (8.9.13). Thus, it can be covered by 
the remaining ones. □ 

Equipped with the preceding Lemma, we may now prove 


Theorem 8.9.1 Let B(x i, 12r) be a ball in some Riemannian manifold, 0 < 
12?’ < i(M), Y the universal cover of a compact Riemannian manifold of 
nonpositive sectional curvature (and thus complete, simply connected, and 
nonpositively curved itself), and let 

f:B( Xl ,12r)^Y 


satisfy 


and 


E(f) < oo 

—Ad 2 (f(x),p) > 0 weakly 


for every p £ Y. 
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Then f is continuous on B(xi,r). 

Here, with the notation of § 8.1 for the metric on the domain M, A is the 
Laplace-Beltrami operator 

1 d 0 d 

Proof. By Lemma 8.9.2, / is bounded on B(xi,3r), hence on B(xo,2r) for 
every Xq £ B(xi,r). 

Thus, 

diam f(B(x 0 ,2r)) < n 0 (8.9.20) 

for some kq < oo. Let now 0 < /sq < kq. We want to find some p > 0 with 

diam f(B(xo,p)) < K\. (8.9.21) 

Let £o = £o(«Oi k l) be as in Lemma 8.9.3. 

Because of (8.9.20), we can bound the number k 0 of balls Bi, . . . , Bk 0 
of radius £ o in Y that are needed to cover f(B(xo,2r)). By Lemma 8.9.3, 
f(B(xo,r)) can be covered by at most fco — 1 of them. If 

diam f(B(x 0 ,r)) > Ki, 

we may apply Lemma 8.9.3 again with | in place of r and k = ko — 1 and 
cover f(B(x o, |)) by at most ko — 2 balls. We can repeat this construction 
until, for some i/gN, 

f(B(xo,2~ v r )) 

is covered by so few balls of radius £o that we must have 
diam f(B(xo,2~ v r)) < K\. 

Since this holds for every xq € B(x\,r) and every K\ > 0, we see that / is 
continuous on B(x\,r). □ 

We shall now present an alternative (and more general) derivation of 
Theorem 8.9.1 not based on Lemma 8.9.3. Of course, the Harnack inequality 
will again be used in a crucial manner. The geometry of the domain M will 
only enter through the Poincare inequality (Lemma 8.9.1) (which implies the 
Harnack inequality) and the following ball doubling property for the volume 
form: 

vol(B(x,2r)) < covol(B(x,r)) (8.9.22) 

for some constant cq, all x £ M, and all sufficiently small radii r > 0. 
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We shall make use of the following abbreviations: 

For v € L°°(B(xq 1 R)): 

v+, R := supi> , := infu , v R := vp(dx) 

B(x 0 ,R) B(xo,R) J 

B(xo,R) 

(as always, sup and inf are the essential supremum and infimum). 

Lemma 8.9.4 Let v be a bounded weak subsolution {—Av > 0) on B(x o, 4 R) . 
There exists a constant 5 q, independent of v and R, with 

v+,r < (1 - <5 0 )v+,4it + S 0 v R . 


Proof. 


V +AR ~V R = j- (u+,4 R - v) 


B(x 0 ,R ) 


< 


( j- \v+, 4 , R - v\ p ) ' since p > 1 


B(xo,R) 


< C 2 


( j- \v+,4R - «| p ) 


B(x 0 ,2R ) 


by (8.9.22) 

< c 3 (v+, 4 R - v +:R ) by Theorem A. 2.2(h) , 

since v + ^ R — v is a nonnegative supersolution on B(xq,4R). 


Consequently 


V +,R < 


C3-1 1 

v + 4R H v R . 

C3 C 3 


□ 


From Lemma 8.9.4, we derive 

Lemma 8.9.5 Let v satisfy the assumptions of Lemma 8.9.4, an d suppose 
0 < £ < \. There exists m £ N (independent of v and e) s.t. 

v+, £ ™r < e 2 n+,i? + (1 - £ 2 )v r , 

for some R' with e m R < R' < ^ (R' may depend on v and e). 
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Proof. Iterating the estimate of Lemma 8.9.4, we get for i/£N 
v+,4 —r < (1 - 5a) v v +tR + (1 - (1 - S Q ) V ) S TiVi-iR 

i— 1 

•ii _ 

Wltll Tr — i_( 1 _5 0 )i. ■ 


We choose v so large that 


(1 - <50)" < ^ 2 , 


and choose R = 4 with j G {1, . . . , u} such that v±-j R is largest. 
Noting that 

4~ v > e m 


if 


TO > 


-(log 4) 
log(l - S 0 ) ’ 


the result follows. 

Lemma 8.9.6 Under the assumptions of Lemma 8.9.f, 

lim v R = lim v + r . 

R—>0 R^O + ’ ' 


□ 


Proof. This follows directly from Lemma 8.9.4. 

Lemma 8.9.7 Let f : B{xo,AR) — » Y satisfy (8.9.3). Let p € Y. 
Then, with 

v(x) := d 2 (f (x) , p) , \B(x 0 ,R)\ ■■= vol{B(x 0 ,R)) 

R 2 


\B(x 0 ,R)\ 


J \\df(x)\\ 2 dvol(x) < c 5 {v +A r - v +tR ) . 


In particular 


B(x 0 ,R) 


lim i R 2 \ 

R-> o \B(x 0 , f?)| 


\\df{x)\\ 2 dvol{x) = 0 . 


B(xo,R) 


□ 


(8.9.23) 


(8.9.24) 


Proof. 

v(x) = d 2 (f(x),p) satisfies (8.9.3), that is, 

—Av > 2||d/|| 2 . 

Let G R (x,y) be the mollified Green function on B(xo,R) relative to 
B(x o, 2 R), i.e. G R (x 0 , •) G H 1 ’ 2 n C$(B(x 2 R)) 
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We put 


J (d<p(x), dG R (xo, x))dvol(x) = j- p(x)dvol(x) 

B(xo,2R ) B(xo,R ) 

for all p £ H 1 ' 2 with supp p <s B(x o, 2 R). 


»*(x) ■.= |B(l °; 2a)l G g (»o,»). 


R 2 


We then have 

J (dp(x),dw R (x)} = J p(x) (8.9.25) 

B{xq,2R ) B(xo,R) 

for all p € H 1,2 with supp p<sB(xo,2R). 


Furthermore, from the estimates for G R of Corollary A. 2.1, we have 

0 < w R < 71 in B(x o, 2 R) 
w R > 72 > 0 in B(xq, R) 

for constants 71,72 that do not depend on R. 

We then have with z := v — v+ar 


(8.9.26) 

(8.9.27) 


A J (df,df)(w R ) 2 < J (w r ) 2 (-A)z 

B(xq,2R) B(xo,2R ) 

= - J ( d(w R ) 2 ,dz ) since w R € H 1,2 (B(xo,2R)) 

B{xq,2R ) 

= -2 J (dw R ,d(w R z)) + 2 J z(dw R , dw R ) 

< —2 J (dw R , d{w R z)) since z < 0 . 

From (8.9.25), (8.9.26), (8.9.27), we get 

J ( df , d/) < H J (v+,4R - v) 


B(x 0 ,R) 


B(x 0 ,R) 


\B(x 0 ,R)\, A 

< C4 ^2 ( W +, 4 K - Vfl) 

< c 5 ^^2 ^ (^+, 4 fl _ % Lemma 8.9.4 


□ 


We are now ready to prove the Holder continuity of /. For a point xo in 
the domain and a radius R > 0, let 
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fn := mean value of / on B(xq, R), 

(that is, as in Lemma 8.9.1, the minimizer of f B ( Xo d 2 (f(x),p)dvol(x) w.r.t. 
p), and 


v p (x) := d(f(x),p ), with p GY chosen subsequently. 

We apply Lemma 8.9.5 to 

v Ir = 

4 

and choose £ = \- £ m < \ and £ m R < R' < ^ , where m £ N does not 
depend on e or R < R 0 . We therefore obtain 


vr> 


j- d 2 (f{x), fz)dvol(x) 

B{x 0 ,R’) 


<C 0 


j- d 2 (f(x),fs)dvol(x) 

B(x 0 ,f ) 


for some Cq independent of i?, 


using the ball doubling property (8.9.22) 


< CiR 2 
\B(x 0 ,R)\ 


\\df\\ 2 dvol(x) 


B(x o,f) 

by the Poincare inequality Lemma 8.9.1 
< + e) by Lemma 8.9.7 


also using the ball doubling property (8.9.22) once more. 


Combining this estimate with Lemma 8.9.5, we get for p in the convex hull 
of f(B(x 0 ,£ m R)) 

sup d 2 (f,p) < 4 sup ci 2 (/,/fl) 

B(xo,£ rn R) B{xo,e rn R) 4 

< 4e 2 sup d 2 (f, Jr) 

B(xo,R) 

+ C 3 (v Pt+! R - V p + r) 

< 16e 2 sup d 2 {f 1 p) 

B(x 0 ,R) 

+ C 3 (Vp t + ,R — 1>p ) + i£ m fl ), 

since £ m < \ 


uj(p) := sup d 2 (f{x),p) = u p , +jP 
B(x 0 ,p) 


We put, for 0 < p, 
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and obtain 

{l + Cz)u{e m R) <(^ + C 3 MR). 

64 

Here, £ m is considered as a constant. By iteration, we obtain 

v(p) < c (j^) “(fto) 

for some c > 0 and some 0 < a < 1. 

This holds for any p in the convex hull of f(B(xo,p)). In particular, we 
may choose 

P= f P - 

Since 

w(p)2 < osc / < 2 u(p)i, 

B{x 0 ,p) 

this implies the Holder continuity of /. Thus, we have obtained another proof 
of Theorem 8.9.1. 

Corollary 8.9.1 Let f : M — > N be a weakly harmonic map between compact 
Riemannian manifolds M and N, with N of nonpositive sectional curvature. 
Then f is smooth. 

Proof. Let B(xi,6r) be a ball in M with 0 < 6r < i(M). Since such a ball 
is simply connected (being the diffeomorphic image of a ball in T Xl M under 
the exponential map exp^.J, we may lift / to a map 

/ : B(x u 6r) -> Y 

into the universal cover Y of N. Therefore, we may apply Theorem 8.9.1 to 
get the continuity of /. The smoothness then follows from Theorem 8.6.1. □ 

In the preceding, we have seen how to use the weak version of the differ- 
ential inequality 

~Ad 2 (f(x),p) > 2||cZ/(a;)|| 2 (see (8.9.3)) 

to derive the continuity of a weakly harmonic map / with values in a manifold 
of nonpositive sectional curvature. 

There is another differential inequality for such a harmonic map that can 
be used to obtain estimates, namely 

-A\\df(x)\\ 2 > ~a\\df(x)\\ 2 , (8.9.28) 

where —a is a lower bound for the Ricci curvature of M. This inequality 
follows from (8.7.13). 
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We shall now display an alternative approach to the regularity result of 
Corollary 8.9.1 that is based on some weak analogue of (8.9.28). Our con- 
struction will exploit the center of mass properties of the approximating func- 
tionals E e (cf. Lemma 8.4.1) and constructions from § 4.8. 

Let f = f e be a minimizer of E e . (Of course, the existence of a minimizer 
for E e follows by the same method as the one for E, see the proofs of Thms. 
8.8.1 and 8.8.2.) By Lemma 8.4.1, for almost every x £ M , f(x) is the center 
of mass of / on the ball B (x, e). (As before, we lift / to a map / : B(x, e) — > Y 
into the universal cover of Y where the center of mass then exists by Theorem 
4.8.4) 

Let now Xi,x 2 £ M with d(x i,x 2 ) < i(M). We define a cliffeomorphism 
f ■ B(x i,e) -> B(x 2 ,e) 

as follows: Let 

ip : T Xl M — > T X2 M 

be the linear map that maps an orthonormal frame at x\ into that orthonor- 
mal frame at x 2 that is obtained by parallel transport along the shortest 
geodesic from X\ to x 2 . %p then is a Euclidean isometry. 

We put 

V '■= exp a , 2 otp o exp" 1 , 

and ip is almost an isometry in the following sense: 

If dv\ and dv 2 are the volume forms on B{ x\,e) and B{ x 2 ,e), resp., then 

\dv 2 — ipifdvi | < ce 2 • Euclidean volume form, 

for some constant c. This is easily seen by writing the volume forms in normal 
coordinates and using Theorem 1.4.4. 

Also, if Vi is the volume of B(xi,e), then 

\Vi-LO m £ m \ <C£ 2 . 10 

We then apply Corollary 4.8.7 to get 

d(f(xi),f{x 2 ))<^r j d(f(y),f(ip(y)) dvol(y) 

B(x i ,s) 

+ / d(f(y)JM) 1 ^- 

B(x 2 ,e) 


(8.9.29) 

yj* dvol(y) 

Vi '■ 


10 c can be controlled by a lower bound on the Ricci curvature of M and an upper 
bound for its sectional curvature, but we do not verify this here 
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We note that 

d(y,ip(y)) < d(x\,x 2 ) cosh (V— Ae), for all y € B(x \,e) 

if A < 0 is a lower curvature bound for M; this follows e.g. from Theorem 
4.5.2. Again, at this point, it is only needed that 

d(y, <p(y)) < d(x i,x 2 )(l + ce 2 ) 


for some constant c. 

We now iterate (8.9.29), i.e. we estimate the quantities d(f(y),f(ip(y)) 
and d{f{y),f{x 2 )) in the integrals on the rhs by applying (8.9.29) again. 
Repeating this a finite number of times, depending on e, and using the fact 
that all errors, i.e. deviations from the Euclidean situation, are quadratic in 
e, we obtain for d(x i,x 2 ) < £ 

I ®Sj„i ( ,) dTO i W 

M B(x,e ) 

for some constant c depending on the geometry of M, 

< c'E e (f)z by Holder’s inequality, 
for some other constant c! . 

This was for d(x \,x 2 ) < £. If d(x i,x 2 ) < v£ for some v G N, we use the 
triangle inequality to obtain 

d(f(x!),f(x 2 )) < d E e {f)v£. (8.9.30) 

This was for a minimizer / = f £ of E e . As for example in the proof of the 
Arzela-Ascoli theorem (see e.g. J. Jost, Postmodern Analysis, Springer, 3rd 
ed., 2005, p. 55-56), one uses (8.9.30) to find a sequence £ n — > 0 for which 
the maps f £n converge uniformly and hence also in L 2 towards some /. By 
Lemma 8.4.5, / minimizes E, and it satisfies the limit of the above estimates, 
i.e. 

d(f(xi),f(x 2 )) < c’E(f)d(x i,x 2 ) 

for all X\,x 2 € M. By the uniqueness theorem proved below (see Theo- 
rem 8.10.2), this estimate then holds for any minimizer of E. We have thus 
shown 

Theorem 8.9.2 Let M and N be compact Riemannian manifolds N of non- 
positive sectional curvature, and let f : M — > N minimize the energy in its 
homotopy class. Then f is Lipschitz continuous. 

□ 
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Corollary 8.9.2 Under the assumption of Theorem 8.9.2, any minimizer f 
of the energy is smooth. 


Proof. By Lemma 8.7.2, / is a weak solution of 


_1 d_ 

py dx a 



dp\ 

dxh ) 


-rj k {f) 


df j df k 
~dxP~dxP' 


(8.9.31) 


By Theorem 8.9.2, the rhs of (8.9.31) is bounded. By Theorem A. 2. 3 of the 
Appendix, therefore f £ C 1,a , for some 0 < a < 1. But then the rhs of 
(8.9.31) is of class C a . Applying Theorem A. 2. 3 once more, yields f £ C 2,a . 
Iterating this argument shows that / is smooth. □ 


Before concluding this §, we want to show how to use (8.9.28) directly to 
get a-priori estimates for harmonic maps. Since we have not been very precise 
about the geometric quantities on which the previous estimates derived in our 
regularity proof depend, we can also use those a-priori estimates to remedy 
that point. These estimates will use the assumption that / is a smooth har- 
monic map, and so, they cannot be used to show regularity. Such estimates, 
however, can be employed in various existence schemes (as for example in 
previous editions of this book). 


Theorem 8.9.3 Let f : M — > N be a harmonic mapping between Rieman- 
nian manifolds, where N is complete, simply connected, and of nonpositive 
sectional curvature. If x £ M,p > 0, and B(x,p) C M, then 

e (f)P) --l\\df(x)\\ 2 <c 0 (l+ max d 2 (f(x),f(y)), (8.9.32) 

2 p z yeB(x,p) 

where cq depends only on m = dim M, on Ap 2 , where A is a bound for the 
absolute value of the sectional curvature of M, and on a lower bound for the 
Ricci curvature of M. 


Proof. We put 

r{y) := d(x,y), 

Q ■■= f(x) 

and assume for simplicity m = dim M > 3. (The proof for m = 2 is 
similar.) We use Lemma 4.7.2 with m instead of n, x instead of p, and 
h(y ) = d 2 (f(y),q). We obtain 
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B(x,p ) 

z ^ ji ltl ^ A , "i -2 f d 2 (f(y), q) 

<-(m-2)u m d (/(*),«) + — 2 — A J r ( y -) m -2 

B(x,p ) 

+ J <P(f(y),o) 

dB(x,p) 

<Ci max d 2 (f (y) , q) , 
y&B{x,p) 


(8.9.33) 


with ci depending on m and yip 2 . 

We next let rj € Cq°(B(x, (()) be a cut-off function, 


0 < i] < 1 on B(x, 

?7(x) = 1 

|Vp| < — , \Ar)\ < 

P P 

We then apply Lemma 4.7.2 to h(y) = q 2 {y)e{f){y) and obtain 


(to 


2 Ml)(x)< J 

B(x,p) 



B(x,p) 


(d 2 e(f))(y) 

r(y) m ~ 2 


(8.9.34) 


Now 

A(n 2 e(/)) < \Ari 2 \e(f) + 4r?|Vp||| Vd/|| • ||d/|| + r?Me(/) 

< % e(/) + ?? 2 || Vd/|| 2 + e(/) 

/CT /T 

-7? 2 ||Vd/|| 2 + c 5 e(/) 

by (8.7.13), since N has nonpositive curvature, where — C5 is a lower bound 
for the Ricci curvature of M 

< c 6 (l + 4r)e(/). (8.9.35) 

P 

From (8.9.34), (8.9.35) 

e(/)(z)<C7(l+^) J ( r( ^ 1 m _ 2 ~ ^=a)e(/)(t/) (8-9.36) 

B(x,p) 
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noting r J_ 2 < const ( ^= 2 ) if r < where the constant depends 

only on m , as well as 77 ( 2 /) = 0 if r(y) > ^ . 

We now recall (8.9.3), i.e. 

-Ad 2 {f{y),p)>Ae{f){y). (8.9.37) 

Then (8.9.32) follows from (8.9.36), (8.9.37), (8.9.33). □ 

From the proof, we also obtain 
Theorem 8.9.4 Under the assumptions of Theorem 8.9.2, 

e(f)(x)< 7 o(l+^) J e(f)(y) = wO- + 

B(x,p) 

where 70 depends on the same quantities as Co in Theorem 8.9.2. 

Proof. From (8.9.36), with p' instead of p and also assuming B(x,p') C M, 
we obtain 

e (f)(x)<c 7 ( 1 +^ 2 ) J d 2 ~ m (x,y)e(f)(y)dy. (8.9.38) 

B(x,p') 


We put 

gi(y,z) := d 2 ~ m (y,z ) 

9k(y,z) ■■= J g k -i(y,w)g 1 (z,w)dw 

B(z,p') 


We observe that 

9k(y,z) < c m d 2k ~ m (y,z). 

For example, for k = 2 

92 (v,z)= J d 2 - m (y,w)d 2 ~ m (z,w)dw. 

B(z,p') 


We split this integral into integrals over the regions 

/ := {w : d(y,w ) < ^d(y,z)} 

II := {w : d(z, w) < ^ d{y,z )} 
III := B(z, p')\I U II. 
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Then 

*(»,,)</+/+/ 

I II III 

< Cmd 4 ~ m {y,z), 

as desired. In particular, g k fy,z) is bounded for k > ™. We now iterate 
(8.9.38); this means that we estimate e(f)(y) in the integral on the right 
hand side of (8.9.38) by using (8.9.38) for x instead of y. We need then 
B(y,p') C M, and so we choose p' = £ to guarantee this condition also for 
the subsequent steps. 

After at most ™ steps, we obtain the desired estimate. □ 


Perspectives. The literature on the regularity of harmonic maps has become too 
numerous and extensive to be reviewed here. (See, however, the Perspectives on 
§ 8.10 for some references.) 

Therefore, in this §, I have rather tried to present a representative sample of 
techniques from geometric analysis. The proof of Theorem 8.9.1 given here is due to 
Lin[169]. I have selected that proof because it employs a fundamental tool, namely 
Moser’s Harnack inequalities, in a particularly elegant and geometrically instructive 
manner. The alternative proof is taken from Jost[136]; it is the most general and 
powerful regularity proof presently known; in particular, in contrast to the preced- 
ing proof, it does not utilize a compactness argument in the target. (The telescoping 
argument Lemma 8.9.7 is originally due to [87]; the Harnack inequalities for the 
mollified Green function can be replaced by a more elementary geometric argu- 
ment, see [126].) The proof of Theorem 8.9.2 given here is taken from Jost[135[. I 
have selected that proof because it elucidates the interplay between the geometric 
meaning of the energy functional and its approximations and the geometric features 
of nonpositively curved manifolds. Finally, the proofs of Theorems 8.9.3 and 8.9.4 
(variants of results of Eells and Sampson[70[) have been developed here because of 
their elementary nature, depending only on the geometry of the distance function 
as described in § 4.7. 


8.10 Harmonic Maps into Manifolds of Nonpositive 
Curvature: Uniqueness and Other Properties 

The results of §§ 8.8, 8.9 can be summarized as 

Theorem 8.10.1 Let M and N be compact Riemannian manifolds, N of 
nonpositive sectional curvature. 

Let g : M — > N be a continuous map. Then g is homotopic to a smooth 
harmonic map f, and f can be obtained by minimizing the energy among 
maps homotopic to g. 
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The existence result was deduced from a convexity property of the energy 
functional E. That convexity also suggests a uniqueness result for minimizers 
of E. Here, we shall present a variant of such a reasoning that applies to all 
harmonic maps and shows that they are in fact all minimizers of E. 

Theorem 8.10.2 Let M be a compact, N a complete Riemannian manifold. 
We assume that N has nonpositive curvature. Let fo, fi ■ M N be homo- 
topic harmonic maps. Then there exists a family ft : M — > N, t £ [0,1], of 
harmonic maps connecting them, for which the energy E(f t ) is independent 
of t, and for which every curve 7 x (f) := ft(x) is geodesic, and || J^7 x (f)|| is 
independent of x and t. If N has negative curvature, then fo and fi either 
are both constant maps, or they both map M onto the same closed geodesic, 
or they coincide. 

If M is a compact manifold with boundary, and if fo\OM = fi\dM> then 
again /o = fi ■ 

Proof. We let 

H : M x [0, 1] -► N 

be a homotopy between fo and f\, with fixed boundary values if dM ^ 0. 
In particular H(x, 0) = f 0 (x),H(x, 1) = fi(x). We let 7 x (t) be the geodesic 
arc homotopic to the arc H(x,t). By Lemma 4.8.1, j x (t) is unique. Again, 
t £ [0,1], and of course j x (t) is parametrized proportionally to arc length, 
and we put f t (x) := 7 x (t). 

By Corollary 8.7.1, since N has nonpositive curvature, 

= J (II V J^ 7 x(t)|| 2 (8.10.1) 

M 

d d 

- trace M < R N (df t , ^7*(*))^7*W><#t)) > 0 

Since f t E{f t )\ t= o = 0 = f t E{f t )\ t=1 , we obtain 

E(f t ) = const. (8.10.2) 

From (8.10.1) then V|^7 x (t) = 0; hence ^7 x (t) is also constant in x. If 
dM ^ 0, hence j^T x {t) = 0 for all x, since this is true for x £ dM\ hence 
fo = fi in this case. 

One also sees that fo and fi and hence by (8.10.2) all maps f t are energy 
minimizing in their homotopy class, hence all harmonic. We also get from 
(8.10.1), (8.10.2), by the nonpositivity of the curvature of N 

{R N (df t , ^7 x{t))^) x (t),df t ) = 0. 

If N has negative sectional curvature, then either ^7 x (t) = 0 in which case 
again fo = fi, or rank® df t (x) < 1 for every x, so that ft is constant or maps 
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M onto the geodesic 7 x (t). If d M = 0, the image of M under / 0 and f\ in 
this case has to be a closed geodesic. □ 

From Theorem 8.10.1 and Corollary 8.7.3, we obtain 

Corollary 8.10.1 Let N be a compact manifold of nonpositive sectional cur- 
vature. 

Then every map from a compact manifold with positive Ricci curvature 
into N is homotopic to a constant map. Every map from a compact manifold 
with nonnegative Ricci curvature, in particular from a flat manifold into N is 
homotopic to a totally geodesic map. If the sectional curvature of N is even 
negative, then any such map is homotopic to a constant map or a map onto 
a closed geodesic. □ 

An implication of Corollary 8.10.1 is that manifolds of positive Ricci cur- 
vature are topologically very different from those of nonpositive sectional 
curvature. 

We shall now prove Preissmann’s Theorem. 

Corollary 8.10.2 Let N be a compact Riemannian manifold of negative 
sectional curvature. Then every abelian subgroup of the fundamental group is 
infinite cyclic, i.e. isomorphic to Z. 

Proof. Let a, (3 € 717 (IV, xq). Thus a and (3 are represented by closed loops 
with base point Xo- If a and [3 commute, the homotopy between a/3 and (3a 
induces a map g : T 2 —> N, where T 2 is a two-dimensional torus, i.e. 


9 ■ [0,1] x [0,1] N 

(3 with g(s, 0) = g(s , 1) = a(s) for all s 

9(0, t) = g(l,t) = (3{t) for all t, 


and in particular g( 0, 0) = g( 1, 0) = g(0, 1) = <7(1 , 1) = Xq, since a, (3 have 
base point Xo . 

By Theorem 8.10.1, g is homotopic to a harmonic map 

/ : T 2 -> N. 

During the homotopy between g and /, the base point may change, but of 
course the two loops corresponding to a and (3 will always have the same 
base point at each step of the homotopy. 

Since N has negative sectional curvature, by Corollary 8.7.1, f(T 2 ) is 
contained in a closed geodesic 7, with base point x± = /( 0, 0). Therefore, our 
two loops in 7 Ti( 7V, 27) (the ones obtained from a and (3 through the homotopy 
from g to /, i.e. the curves /( 0, •) and /(• , 0)) are both multiples of 7. Thus 


a 


(3 


a 
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they are contained in a cyclic subgroup of This cyclic subgroup 

has to be infinite as otherwise for some k £ N would be homotopic to a 
constant loop (representing the trivial element of 7Ti(iV, aq)), contradicting 
uniqueness of geodesics (Lemma 4.8.1), as is a geodesic since 7 is, and of 
course a constant loop is also geodesic. 

Thus, the subgroup of Xo) generated by a and /3 is isomorphic to 
an infinite cyclic subgroup. This is true for any two commuting elements in 
ni(N,xo), and the conclusion follows. □ 


Perspectives. While the concept of harmonic maps had been introduced earlier 
by Bochner[24], the insight that led to the existence theorem 8.10.1, namely that 
nonpositive target curvature leads to a useful differential inequality via a Bochner 
formula was obtained by Al’ber[4, 5] and Eells and Sampson[70]. Once this had been 
noted, one could essentially apply the linear argument of [179] to obtain regularity 
and existence of harmonic maps with values in manifolds of nonpositive curvature. 
In fact, Al’ber[5] also showed uniqueness (Theorem 8.10.2) and already conceived 
the general scheme of applying harmonic maps to the investigation of the topology 
of manifolds of nonpositive curvature; in particular, he was the first to derive Preis- 
mann’s theorem from a harmonic map identity. Thus, his work is one of the several 
instances encountered in this book when mathematicians in the former Soviet Union 
obtained results that were not given credit in the Western countries, sometimes from 
ignorance, but sometimes also deliberately. Hartman[112] also obtained the unique- 
ness result for harmonic maps into manifolds of nonpositive curvature. For the case 
of manifolds with boundary, such results were obtained by Hamilton [108]. These 
authors used a parabolic method. They considered the so-called heat flow, i.e. the 
problem 

/ : M x [0, 00 ) — + N 
|£(M) =r(f{x,t)) 

(where the tension held r is taken w.r.t. the a;-variable on M ) 

/(*, 0 ) = g{x). 

They showed that a solution exists for all t > 0, and as t —> 00 , f(x, t) converges to 
a harmonic map homotopic to g. This needs parabolic analogues of the estimates 
of Theorem 8.9.3, Theorem 8.9.4. A detailed and simplified presentation of this 
approach is given in [126]. Elliptic methods were first introduced into harmonic 
map theory by Hildebrandt, Kaul, and Widman[117, 118]. 

Hildebrandt-Kaul-Widman[118] were also able to handle positive image curva- 
ture. They solved the Dirichlet boundary problem for harmonic maps with values 
in a ball B(p,p) in some Riemannian manifold N, with p < min (i(p), where 

k > 0 is an upper bound on the sectional curvature of N. 

The proof allows an important simplification by a result of Kendall[155]. He 
constructed suitable convex functions on such a ball. Such geometric constructions 
adapted to positive curvature had earlier allowed Jager, Kaul[125] to show that the 
solution for the harmonic Dirichlet problem in such a ball is unique. See also [126] 
for a presentation of these results. 

Hildebrandt-Kaul-Widman[118] also discovered that without that convexity 
condition on the target ball, critical points of the energy can be discontinuous, 
and they found the basic example of a singularity, namely the map 
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/ : B(0, 1)(C K") — > S" -1 

X 

x 1 > 1 j" 

1*1 

For n > 3, it has finite energy and is a critical point for the energy. 

Schoen and Uhlenbeck[215, 216] and in a somewhat different context also Gi- 
aquinta and Giusti[87, 88] then developed a regularity theory for energy minimizing 
maps. They discovered that the above example is the prototype of a singularity, 
that energy minimizing maps are regular except possibly on set of Hausdorff di- 
mension at most dim M — 3 and that singularities can be precluded if there are 
no nontrivial energy minimizing harmonic maps from a sphere S k (k > 2) into the 
target. Note that in the above example, for r > 1 ,/(-) defines a harmonic map 
from the sphere S'” -1 into S'" -1 . In the general case of a singularity, the same has 
to happen at least in the limit r — > oo. For a detailed account of the theory and its 
subsequent developments, we recommend Steffen[230]. 

Returning to nonpositive image curvature, as mentioned above, Al’ber[5] was 
the first to observe that harmonic maps can be used to prove Preissmann’s theorem. 
Extensions of Preissmann’s theorem, i.e. further restrictions on fundamental groups 
of compact manifolds of nonpositive curvature, were found by Yau[262], Gromoll 
and Wolf[97], Lawson and Yau[167]. The harmonic map approach to these results 
is presented in [129]. Recently, a general theory of harmonic maps between metric 
spaces has been developed. A systematic description, together with the appropriate 
references, can be found in Jost[137]. 

We now want to discuss some further results about harmonic maps and their 
applications. 

The first topic are so-called harmonic coordinates. Let M be an n-dimensional 
Riemannian manifold. Local coordinates are diffeomorphisms from an open subset 
U of M onto an open subset of R d . They are called harmonic if the coordinate 
functions are harmonic. Harmonic coordinates have been employed in general rela- 
tivity. They were introduced into Riemannian geometry by Sabitov and Shefel’[208] 
and by de Turck and Kazdan[54] by showing that the metric tensor when written 
in harmonic coordinates has the best possible regularity properties. (In particular, 
the regularity properties are better than those of normal coordinates.) Explicit es- 
timates were developed parallely and independently by Jost and Karcher[141] and 
Nikolaev[192]. The precise result of Jost-Karcher is 


Theorem. Letp £ M. There exists Ro > 0, depending only on the injectivity radius 
of p, the dimension n of M, and a bound A for the absolute value of the sectional 
curvature on B(p, Ro) with the property that for any R < Ro, there exist harmonic 
coordinates on B(p,R) the metric tensor g = (gtj) of which satisfies on each ball 
B(p, (1 — 5)R ) for every 0 < a < 1 


Iffl 


C 1 '" T 


< c(AR 0 ,n,a) A 2 R 2 


8 2 


(Here, the norm is the usual one of the Holder space C 1,a .) In particular the a- 
Holder norms of the Christoffel-symbols are bounded in terms of ARo and n. 


(See also the presentation in [126].) It is easy to construct harmonic functions on 
balls B(p, Ro). A difficult point, however, is to construct n harmonic functions that 
furnish an injective map of maximal rank into 1R". This is the main achievement 
of the preceding result. As an application, one obtains C 2 ’“ estimates for harmonic 
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maps between Riemannian manifolds, depending only on the dimensions, injectiv- 
ity radii and curvature bounds of the manifolds involved provided one knows an 
estimate for the modulus of continuity of the maps already. (Otherwise, no esti- 
mate can hold, see Theorem 8.2.2). These estimates were also the crucial tool for 
the proofs of the Gromov compactness theorem (see Short survey on curvature and 
topology, above). 

We already described in the Perspectives on § 4.8 how to define a notion of a 
metric space of nonpositive curvature. Now, by an extension of the construction 
presented in § 8.4, one may define an energy integral for maps between metric 
spaces as a generalization of the energy integral in the Riemannian case considered 
here. Again, it turns out to be expedient not to work with maps between compact 
spaces as we did in this §, but rather to lift to their universal covers and con- 
sider equivariant maps. Thus, let X and Y be metric spaces with isometry groups 
J(A') and I(Y), resp., F a (typically discrete) subgroup of I(X),p : F — > I(Y) a 
homomorphism. We then call / : X — > Y p-equivariant if 

fi'yx) = p{l)f(x) for all x € X, 7 € r. 

Of course, if M and N are compact Riemannian manifolds with fundamental groups 
7 ti(M) and m(N), resp., then these groups operate by deck transformations on the 
universal covers A' := M, Y := N, and a homotopy class of maps from M to N 
defines a homoporphism 


p : 7Ti(M) — > m(N) C I{Y), 

and the lift of any map in that homotopy class to the universal covers then has to 
be p-equivariant. In fact, if IV is a so-called n(n, l)-space, meaning that all higher 
homotopy groups nk(N),k > 2, are trivial (such an N is also called aspherical, 
because that means that every continuous map ip : S k —> N, k > 2, is homotopic to 
a constant map), then conversely the push down of any p-equivariant map lies in the 
homotopy class defining p. This device, namely to work with p-equivariant maps, 
among other things, has the important advantage that it also naturally applies in 
situations where some of the elements 7 and p( 7) have nontrivial fixed points, i.e. 
where the spaces X/F and/or Y/ p(F) may have singularities. The energy of a p- 
equivariant map then is simply defined by integration over a fundamental region 
of r in A. Minimizers are called generalized harmonic maps. The key feature of 
the assumption of nonpositive curvature then is that it makes the energy integral 
a convex functional on spaces of p-equivarant, square integrable maps as in § 8.8. 

As already indicated, this works in considerable generality, and in fact, such 
generality is useful for example in the context of superrigidity discussed below 
where certain metric spaces of nonpositive curvature that are quite far from be- 
ing manifolds naturally occur. Some of those spaces even are not locally compact 
anymore. 

A theory of such generalized harmonic mappings has been developed by J. 
Jost[133, 134, 135, 137] and independently (but under more restrictive assumptions, 
like local compactness) by Korevaar and Schoen[162]. In fact, a key point of the 
approach of Jost is that the convexity of the functional can compensate the lack 
of local compactness of the target in existence proofs. (Subsequently, Korevaar and 
Schoen[163] reproved a special case of those existence results by a variant of the 
method of Jost.) Actually, still more generality can be achieved, and new light can 
be shed on why nonpositive curvature is the fundamental assumption for harmonic 
maps. Namely, a space of p-equivariant, square integrable maps into a space of 
nonpositive curvature is itself a space of nonpositive curvature (of course, not locally 
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compact anymore even if the original target had been locally compact), and the 
existence of generalized harmonic maps can then be deduced from an existence 
theorem for minima of convex functionals on spaces of nonpositive curvature. In 
fact, we have displayed this existence method in § 8.8 in the setting of a Riemannian 
target. For a comprehensive treatment, we refer to [137]. 

We finally want to discuss the applications of harmonic maps to superrigidity 
results (see the Perspectives on 5.5). 

As explained in the Perspectives on 8.7, Siu derived a Bochner type identity for 
harmonic maps between Kahler manifolds. If the image has nonpositive curvature 
in a suitable sense, it implies that the product of the Hessian of the map with the 
Kahler form of the domain vanishes, or in other words, that the map is plurihar- 
monic. A detailed study of the curvature tensors of Hermitian symmetric spaces 
(i.e. those that are Kahler) of noncompact type then allowed him to conclude that a 
harmonic homotopy equivalence between compact quotients of such spaces is holo- 
morphic or antiholomorphic. It then also is a diffeomorphism. If the domain is also 
a quotient of a Hermitian symmetric space, one can then show that the map is an 
isometry, proving Mostow’s theorem in the Hermitian case. It is interesting to note 
that the curvature terms to be investigated here come from the image and not from 
the domain. Sampson[213] found a different formula that applies to harmonic maps 
from Kahlerian to Riemannian manifolds. Corlette[52] showed that the product of 
the Hessian of a harmonic map with any parallel form on the domain vanishes if 
the image has nonpositive curvature. For quotients of quaternionic. hyperbolic space 
and the hyperbolic Cayley plane this allowed him to conclude that the Hessian itself 
vanishes, i.e. that a harmonic map from such a quotient into a nonpositively curved 
manifold is totally geodesic. This again implies a rigidity theorem. 

If one wants to derive so-called nonarchimedean superrigidity and arithmeticity 
of lattices (see Perspectives on 5.5), one has to study homomorphisms of lattices 
into Sl(n,Qp)(Q p = p - adic numbers). It turns out that this group operates on a 
so-called Tits building, a certain simplicial metric space with nonpositive curvature 
in the sense of Alexandrov. Gromov and Schoen[104] then developed a theory of 
harmonic maps from Riemannian manifolds into such spaces. In particular, they 
could extend Corlette’s results to the p-adic case and obtain arithmeticity of the 
corresponding lattices. 

The most general superrigidity results for harmonic maps were obtained by Jost 
and Yau[148] and Mok, Siu and Yeung[186]. Since the image of a lattice need not 
be a lattice anymore, once more, one has to work with p-equivariant maps. 

The result then is that any such harmonic map is totally geodesic, i.e. we have 

Theorem. Let M = G/K be an irreducible symmetric space of noncompact type, 
other than SO Q (p, l)/SO(p) x SO(l), SU(p, l)/S(U(p) x U(l)). 

Let F be a discrete cocompact subgroup ofG (i.e. a cocompact lattice). Let N be a 
complete simply connected Riemannian manifold of nonpositive curvature operator 
with isometry group I(N). Let p : F — > I(N) be a homomorphism for which p(T ) 
either does not have a fixpoint on the sphere at oo of N or if it does, it centralizes 
a totally geodesic flat subspace. Then there exists a totally geodesic p-equivariant 
map, 

f : M ->■ N . 


(With the method of Mok- Siu- Yeung, the curvature assumption on N can be 
weakened; if M = G/K is of rank > 2 then it suffices that N has nonpositive 
sectional curvature.) 
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The proof follows from a careful choice of the parameter A in the Bochner 
formula of the Perspectives on 8.7 and a detailed study of the curvature tensors of 
symmetric spaces. 

The corresponding result for SOo(p, 1)/ SO(p)x SO(l) and SU(p, 1)/ S(U(p)x 
U(l)) is false, because it would imply that compact quotients have vanishing first 
Betti number and there are examples of compact quotients of these spaces for which 
this is not the case. In the case of SU(p, 1)/ S(U(p)x U(l)), one gets, however the 
existence of a pluriharmonic p-equivariant map, essentially a special case of the 
result of Siu quoted above. 

For Sp(p, 1)/ Sp(p)x Sp(l) and the hyperbolic Cayley plane, the result is Cor- 
lette’s theorem quoted above. For Hermitian symmetric spaces, the result is due to 
Mok[185, 184], 

A consequence of the theorem is 

Corollary. Let M = G/K and F be as above. Let H be a semisimple noncompact 
Lie group with trivial center, p : F — > H a homomorphism with Zariski dense image. 
Then p extends to a homomorphism from G onto H. 

As explained above, this result is due to Margulis for rank (G/K) > 2 and to 
Corlette for Sp(p, 1)/ Sp(p)x Sp(l) and the hyperbolic Cayley plane. 

Using the constructions of Gromov and Schoen, the result extends to the nonar- 
chimedean case to show 

Theorem. Let M = G/K and F be as above. 

Let p : r — > Sl(n, Q p ) be a homomorphism, for some n £ N and some prime p. 
Then p(F) is contained in a compact subgroup o/Sl(n, Q p ). 

As explained above, the result is again due to Margulis for rank (G/K) > 2, 
and to Gromov-Schoen for quaternionic hyperbolic space and the hyperbolic Cayley 
plane. 

The harmonic map approach to rigidity is still not complete: 

First of all, so far it has been unable to derive Mostow’s rigidity theorem for 
quotients of real hyperbolic space. Secondly, the results for spaces that are of fi- 
nite volume but not compact (i.e. for nonuniform lattices) are still not complete. 
Margulis’ results, for example, also hold in the noncompact case. (For rank 1, rigid- 
ity results were shown earlier by G. Prasad.) In the Hermitian symmetric case, 
however, this problem was solved by Jost and Zuo[149]. 

A new and very interesting approach to rigidity that applies particularly well 
in the case of real hyperbolic spaces has been developed by Besson, Courtois and 
G allot [20, 21], 

One open problem that is quite easy to formulate but as yet unsolved is the 
following one of H. Hopf: Let M 2m be a compact manifold of even dimension 2m 
that admits a Riemannian metric of nonpositive sectional curvature. Is it then true 
that the Euler characteristic of M satisfies 

(-l) m x(M 2m ) > 0 

(with strict inequality in the case of negative sectional curvature)? So far, this 
has only been demonstrated under additional conditions, e.g. that the curvature 
is pinched between two negative constants, see for example Donnelly, Xavier[62], 
Bourguignon, Karcher[28], Jost, Xin[145]. If the manifold carries a Kahler metric, 
then this conjecture has been verified by Gromov[100], in the case of negative 
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sectional curvature, and by Jost, Zuo[150] and Cao, Xavier[37] in the nonpositive 
case. 


Exercises for Chapter 8 


1) Show that every two-dimensional torus carries the structure of a Rie- 
mann surface. 

2) Determine all harmonic maps between tori. 

(Hint: Use the uniqueness theorem and the fact that affine linear maps 
between Euclidean spaces are harmonic.) 

3) Determine all holomorphic quadratic differentials on a two-dimensional 
torus, and all holomorphic quadratic differentials on an annular region 
{z £ C : 7*i < \z\ < V 2 } (0 < n < r 2 ) that are real on the boundary. 

4) Show that the conclusions of the Hartman-Wintner-Lemma 8.2.6 con- 
tinue to hold if (8.2.17) is replaced by 

\u zz | < K(\u z \ + |u|). 


5 ) 


We let £ be a Riemann surface and H : M 3 — > R. be a smooth function. 

For a map / : £ — > R 3 we consider the equation 

(|s + = 2/f (/(*))/, A /„ 

where z = k + iy is a conformal parameter on £ and A denotes the 

standard vector product in R 3 . 

a: Show that, if / is conformal, H(f(z)) is the mean curvature of 

the surface f{£) at the point f(z). 

b: If £ = S' 2 , show that every solution is conformal. 

c: If £ is the unit disk D and / is a solution which is constant on 

dD, show that it is constant on all of D. 

d: Show that for a nonconstant solution, f x and f y have only 

isolated zeroes. 

e: At those points where f x and f y do not vanish, we define 


( fxx 1 fx A fy) 

1/xA/^l 

(fxy, fx A fy) 
\fx A fy I 
(fyy> fx: A fy) 
\fx A fy\ 
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(using the Euclidean metric of R 3 ). 

Show that for a solution with H = const., i pdz 2 := (L — N — 
2iM)dz 2 is a holomorphic quadratic differential. 

Conclude that <p, since holomorphic and bounded, extends to 
all of £ as a holomorphic quadratic differential. 

f: If H = const, and £ = S 2 show that every solution f(£) has 

constant and equal principal curvatures at each point. Conclude 
that it is a standard sphere of radius i.e. f(£) = {x € R 3 : 

\x — xo| 2 = jj} for some Xo- 

(Hint: Use a), b), e) and Lemma 8.2.4.) 

Remark: By the uniformization theorem, every two dimensional 
Riemannian manifold M diffeomorphic to S 2 admits the struc- 
ture of a Riemann surface and a conformal diffeomorphism 
K : S 2 — > M. It thus is conformally equivalent to S 2 . The 
exercise then implies that every surface diffeomorphic to S 2 
and immersed into R 3 with constant mean curvature is a stan- 
dard “round” sphere. This result, as well as the method of proof 
presented here, were discovered by H. Hopf. 

6) Prove Theorem 8.3.3, assuming only that N is complete but not ne- 
cessarily compact. 

7 ) 

a: We call a closed subset A of a Riemannian manifold N convex 

if any two points in A can be connected by a geodesic arc in 
A. We call A strictly convex if this geodesic arc is contained in 
the interior of A with the possible exception of its endpoints. 
We call A strongly convex, if its boundary dA is a smooth 
submanifold (of codimension 1) in N and if all its principal 
curvatures w.r.t. the normal vector pointing to the interior of A 
are positive. Show that a strongly convex set is strictly convex. 

b: Show that a strongly convex subset A of a complete Riemannian 

manifold N has a neighborhood whose closure B\ and B 0 := A 
satisfy the conclusions of Lemma 8.3.2. 

c: Show that Theorem 8.3.1 continues to hold if N is only com- 

plete, but not necessarily compact, again with ^(N) = 0, pro- 
vided p(£) is contained in a compact, strongly convex subset 
A of N. In that case, the harmonic f : £ — > N also satisfies 
f(£) c A. 

8) In this exercise, still another definition of the Sobolev space H 1,2 ( M , N) 
will be given. The embedding theorem of Nash (see the Perspectives 
on 1.4) implies that there exists an isometric embedding 

i : N -> R fc 



into some Euclidean space. 
We then define 
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H]’ 2 {M, N) := {/ G : f(x) G i(N) 

for almost all x G M } . 

Show that 

H 1,2 (M, N) = h]' 2 (M,N). 

(Hint: Theorem 8.7.1 implies that = iJ- 1,2 (M, R fe ) since 

every map into R fc is localizable.) 

9) 

a: For 1 < p < oo and / G L p ( M, N), we define 

e pAI) : = [ [ d p (f(x),f(y))dvo\(y)dvo\(x) 

0J m £ m+p J M JB(x,e) 

(with the same notation as in (8.7.1)), and 

E p (f) := lim E pe (f) GlU {oo} 

£ — ^0 

(show that this limit exists). We say that / G L p ( M, N) belongs 
to the Sobolev space H 1 ’ P (M, N) if E p (f) < oo. Characterize 
the localizable maps belonging to H 1,P (M, N). 

b: Show lower semicontinuity of E p w.r.t. L p -convergence, i.e. if 

(AO ,/eN converges to / in L P (M, N), then 

E p (f) < lim inf E p (f v ). 

is—> OO 

c: Derive the Euler-Lagrange equations for critical points of E p . 

(The smooth critical points are called p-liarmonic maps. The 
regularity theory for p-harmonic maps, however, is not as good 
as the one for harmonic maps. In general, one only obtains 
weakly p- harmonic maps of regularity class C 1,Q for some a > 
0 .) 

d: Show the existence of a continuous weakly p-harmonic map 

(minimizing E p ) under the assumptions of Theorem 8.3.1. 

e: Extend the existence theory of § 8.8 to E p . 

10) Derive formula (8.7.13) in an invariant fashion, i.e. without using local 
coordinates. 

11) Prove the following result that is analogous to Cor. 8.7.4. A smooth 
map f : M —> N between Riemannian manifolds is totally geodesic 
if and only if whenever V is open in N, U = / -1 (P), h : V — > R is 
convex, then ho f : U — > R is convex. 
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Let M be a compact Riemannian manifold with boundary, N a Rie- 
mannian manifold, / : M — > N harmonic with f(dM) = p for some 
point p in N. Show that if there exists a strictly convex function h on 
f(M) with a minimum at p , then / is constant itself. 

State and prove a version of the uniqueness theorem 8.10.2 for mini- 
mizers of the functionals E e . Show that, as for the energy functional 
E, any critical point of E s (with values in a space of non-positive 
sectional curvature, as always) is a minimizer. 



Appendix A: Linear Elliptic Partial 
Differential Equations 


A.l Sobolev Spaces 

We are going to use the integration theory of Lebesgue. Therefore, we shall 
always identify functions which differ only on a set of measure zero. Thus, 
when we speak about a function, we actually always mean an equivalence 
class of functions under the above identification. In particular, a statement 
like “the function / is continuous” is to be interpreted as “/ differs from a 
continuous function at most on a set of measure zero” or equivalently “the 
equivalence class of / contains a continuous function” . 

Replacing functions by their equivalence classes is necessary in order to 
make the L p - and Sobolev spaces Banach spaces. 

Definition A. 1.1 17 C open, p £ R, p > 1, 

L p {fi) := |/ : 17 — > R. U {±oo} measurable 

and ||/|| L P (fi ) : = (y \f(x)\ P dxj ” < ooj 
n 

L°°(l7) := j/ : 17 — > R U {±oo} measurable 

and || /|| z,°°(r 2 ) == ess sup|/(a;)| < oo}, with 

ess sup f(x) := inf l a € KU {oo} : f(x) < a for almost all x € 17 >. 
xeo 1 ' -* 


Theorem A. 1.1 L p (l7) is a Banach space for 1 < p < oo (with norm 
II • \\ LP ({!))■ 


Theorem A. 1.2 (Holder’s inequality). Let p, q > 1, ^ ^ = 1 {q = oo for 

p = 1 and vice versa), f £ L p (f2),g £ L 9 (l7). Then fg £ L 1 (l7) and 


J I f(x)g(x)\dx< 
n 
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More generally, for p!,... p m > 1 + + = 1 , fi G L Pi (I2),i = 1 


i j ./■/(*'•) < n 


i = 1 


\fi(x)\ p 


1 

Pi 


Theorem A. 1.3 If (/„)„ e n converges to f in L p (f2), then a subsequence 
converges pointwise almost everywhere to f. 

Theorem A. 1.4 Co°(l?) is dense in L P (I2) for 1 < p < oo (but not for 
p = oo). 

Theorem A. 1.5 If f G L 2 (I2) and J f(x)ip(x)dx = 0 for every <p G Cq°(I2), 

n 

then f = 0. 

L p oc {f2) ■- {/ : n -* R U {±oo} : / G L p (tf) for V 12' <g 12} 

Definition A. 1.2 Let / G Lj oc (I2). We call v G L( oc (I2) the weak derivative 
of / in the direction of x l , v = Dif , if 

J v(x)tp(x)dx = - J f(x) dx 
n n 

for all ip G Cq ( f2). Here x = (; x 1 , . . . , x n ) G K". 

Weak derivatives of higher order are similarly defined (notation D a f for 
a multiindex a). 

Definition A. 1.3 k G N, 1 < p < oo. We define the Sobolev spaces and 
Sobolev norms as follows: 

W k ’ p (f2) := {/ G L p (f2) : Vet with \a\ < k : D a f G L p (fl)} 

\\f\\w«.p(n) ■= ( [ \ D «f\ P J for 1 < p < oo 

\M<*n / 

ll/llw.«(«) := 51 ess sup \D a f(x)\ 

\a\<k aGO 

Hq ,p (I2) := closure of Cq°(I2) w.r.t. || • || w k -p(n) 

H k,p (f2) := closure of C°°(Q) w.r.t. || • \\w k -v(n) 


Theorem A. 1.6 W k ’ p (I2) = H k ’ p (I2) for 1 < p < oo, k G N. W k ’ p (f2) is a 
Banach space for 1 < p < oo, k G N. 
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Some local properties of Sobolev functions: 


Lemma A. 1.1 h? C open, f € l),i € {1, . . . , d}. Then for almost 

all AgR, /ijj-i-x} is absolutely continuous. 


Let / £ L 1 (l 7), 17 open in ]R d . Then for almost all Xq £ 12 


lim 

r — »0 


l 

\B(x 0 ,r)\ 


I f(x) - f{x 0 ))\dx = 0 


(|l?(a;o,r)| = Udr d denotes the Lebesgue measure of the ball B(xo,r)). 

An Xo satisfying this property is called a Lebesgue point. If Xq is a 
Lebesgue point, then / is approximately continuous at Xq; this means the 
following: 

For s > 0, let 

S e := {y£ l 7 : \f(y) - f(x 0 )| < e} . 


Then 


lim 

i — >0 


I s e n B(x a ,r) I 
\B{x 0 ,r)\ 


= 1 


for all £ > 0. 


Similarly, / £ 17 1 ’ 1 (17) is called approximately differentiable at Xo £ 17, with 
approximate derivative V/(x o), if for 


Si :={y£f2 : \ f(y) - f{x 0 )(y - x 0 ) - V/(a:o)| < e\y - cc 0 |} 


We then have 


lim 

r — »0 


\Sl nB(x 0 ,r)\ 
\B(x 0 ,r)\ 


= 1 


for all £ > 0. 


Lemma A. 1.2 A function f £ 17 1 ’ 1 (17),17 C open, is approximately 
differentiable almost everywhere, and the weak derivative coincides with the 
approximate derivative almost everywhere. 

Lemma A. 1.3 17 C open, £ : R — > R. Lipschitz, f £ H 1 ’ p (f2). If to f £ 
L p (f2), then to f £ H 1,P (I2) and for almost all x £ 12, 

Bi(£ o f)(x) = e'(f(x))D i f(x), i = l,...,d. 


Theorem A. 1.7 (Sobolev embedding theorem). 17 C R" open, bounded, f £ 
Hq P (I 2). Then 

np 

f £ L for p < n 
f £ C°(I2) for p > n 
More precisely, 3 constants c = c{n,p) : 
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\\f\\ L ^ (0) < c\\Df\\ LP{0) for p < n 
sup |/0)| < c Vol(f?)" _ r \\Df\\ LP( a) for p > n 

Q 

For n = p, f £ L q (f2) for all q < oo. 

Remark. H 1 ^ n (I2) is not contained in C°(l7) or L°°(f2). 

Let us consider the following example: 
d > 2, L? = B{ 0, i) C R d , /( x) := loglogj^y is in H^ d {Q), but has a 
singularity at x = 0 and is unbounded there. Using this example, we may even 
produce functions in H 1,d with a dense set of singular points. For example, 

O 

take 17 = 1?(0, C R d , let (p„)„eN be a dense sequence of points in 17 and 
consider 

ffO) '■=^2‘2F V f(x - p v ). 

V 

Corollary A. 1.1 (Poincare inequality). 17 C R" open, bounded, 
f G Hg’ 2 (f2) => ||/||z, 2 ( fi) < const. vol(!7)« ||H/|| L 2 (fi) . 


Corollary A. 1.2 17 C R" open, bounded 


H k 0 ’ p (f2) C 


£,n-ftp(L?) for kp < n 
C m (JT) for 0 < m < fc — (r 


In particular, if f £ ,p (l7) /or all k £ N and some fixed p, then f £ C°°(l7). 


Theorem A. 1.8 (Rellich-Kondrachov compactness theorem). 17 C R" open, 
bounded. Suppose 1 < q < if p < d, and l<q<oo if p>d. Then 
H 0 ’ p (f2) is compactly embedded in L q (I2), i.e. if (f n )ne N C 7/0(17) satisfies 

Wfn || w 1 ’ p (0) < const., 
then a subsequence converges in L 9 (l7). 

Corollary A. 1.3 17 as before. Then is compactly embedded in 

L 2 (17). 


H k ’ 2 (I2) is a Hilbert space, the scalar product is 


(/; d)H k ’ 2 (0) 


O' f D a f(x)D a g(x)dx. 

\ a \<k J n 


Finally, we recall the concept of weak convergence: 
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Let H be a Hilbert space with norm || • || and a product (•,•). Then 
(vn)neN C H is called weakly convergent to v € H , 

v n -r v, 


iff 

(v n ,w) — > (v,w) for all w £ H. 

Theorem A. 1.9 Every bounded sequence (w n )neN H contains a weakly 
convergent subsequence, and if the limit is v, 

IMI < liminf ||t>„|| 

n—> oo 

(where (v n ) now is the weakly convergent subsequence). 

Example. Let (e n ) be an orthonormal sequence in an infinite dimensional 
Hilbert space. Then e n —r 0. In particular, the inequality in Theorem A. 1.9 
may be strict. 


A. 2 Existence and Regularity Theory for Solutions of 
Linear Elliptic Equations 

17 will always be an open subset of R m . 

For technical purposes, one often has to approximate weak derivatives if 
they are not yet known to exist by difference quotients which are supposed 
to exist. Thus, let 

f£L 2 (f2,R), 

(ei, . . . , e m ) an orthonormal basis of M m 
h £ M, h 0. 

We put 

A if( x ) '■= + ^ (if dist(a;,ai7) > \h\). 

If tp £ L 2 (17),supp <p (s 17, \h\ < dist(supp (p,df2), we have 

J {Aif(x))<p(x)dx = - j f(x)A~ h ip(x)dx. (A. 2.1) 

Q Q 

Lemma A. 2.1 If f £ 77 1,2 (17), 17' <g 17, \h\ < dist(I2' , d 17) , then Abf £ 
L 2 (l7') and 


W^fh^a') < \\Dif\\ L 2 {n) for i = 1 
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Conversely 

Lemma A. 2. 2 If f £ L 2 (f2) and if for some K < oo 

IIA- B /IU’(flo < K 

for some sequence h n —> 0 and all 12' (e 12 with h n < dist(f2 ' , df2), then the 
weak derivative Dif exists and 


\\DJ\\ LH o) < K. 


The fundamental elliptic regularity theorems for Sobolev norms may be 
proved by approximating weak derivatives by difference quotients. 

We now formulate the general regularity theorem. 

We consider an operator 

£/(*) = (A.2.2) 

for x £ f2, f : Q -> R, 12 C K m . 

We assume that there exist constants 0 < A < p with 

A|C| 2 < a af3 (x)Z a Zts < p,|£| 2 (A. 2.3) 

for all x £ 12,1; £ R m . We say that L is uniformly elliptic. Let k € L 2 {f2). 
Then / £ H 1,2 (S 7) is called weak solution of 


Lf = k 


if 


J a a/3 (x)Di3f(x)D a ip(x)dx = ^J k(x)ip(x)dx (A. 2. 4) 

n n 


for all ip £ Hq' 2 (I2). 


Theorem A. 2.1 Let f £ H 1 ’ 2 (I2) be a weak solution of (A. 2. f). Suppose 
k £ H u ’ 2 (12) , a a @ £ C' +1 (I2) (v £ N). 

Then 

f £ H v+2,2 (T2') 


for every 12' <£ f2. 

If 


ll aQ/3 ||c‘ , + 1 (n) < Kv 


then 

ll/lli/"+ 2 ’ 2 (J? / ) < C (II/IU 2 (17) + INI H"’ 2 ^)), 

where c depends on m, A, v, A'„, and dist(f? / , df2). 


(A.2.6) 



A. 2 Linear Elliptic Equations 537 


The Harnack inequalities of Moser are of fundamental importance for the 
theory of elliptic partial differential equations: 


Theorem A. 2. 2 Let L be a uniformly elliptic operator as in (A. 2. 2), 
(A. 2.3). 

(i) Let u be a weak subsolution, i.e. 

Lu >0 in a ball B{xq,4R) C R m 
(f a a ^D p uDipp < 0 for all ip € Ft^' 2 (B(xq, 4R))) . For p > 1 then 


(ii) 


sup U < Cl 
B(xq,R ) 



,(2 ry 


(max(u(a;), 0 ) p dx) 


B(x 0 ,2R ) 


l 

p 


where Ci depends only on m and If in (A. 2. 3). 
Let u be a positive super solution, i.e. 


Lu <0 in a ball B{xq,4R) C 


For m> 3 and 0 < p < then 


( 


1 


V 


w m ( 2-R) m 



B(x 0 ,2R) ) 


l 

p 


< 


C2 


- P ) 2 


c-i again depending only on m and j. For m = 2 and 0 < p < oo, 
the same estimate holds when , m ° 2 is replaced by a constant C 3 

V m-2 P) 

depending on p and ff . 


The Harnack inequality also translates into estimates for the fundamental 
solutions of the Laplace-Beltrami operator, and their generalizations, the 
Green functions. The Green function G(x q,x) of a ball B C M (or another 
sufficiently regular domain), for xq in the interior of B , is symmetric in x 
and Xo, smooth for x ^ Xo, becomes singular like ( d _^ 0Jd d(x, Xo) 2 ~ d in case 
d = dim M > 3 (u>d = volS' d_1 ) (and like A-logd(xo,x) for d = 2), vanishes 
for x € dB , and satisfies 

h(x 0 ) = f Ah(x) G(xo, x)dvol(x) for all /i € Cq(B). 

Jb 

A geometric approximation of the Green function (that is exact in the Eu- 
clidean case) has been investigated in 4.7. An analytic alternative that allows 
to avoid the singularity is the use of the mollified Green function. For simplic- 
ity, and because that typically suffices for applications, we only consider the 
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case of a ball. The mollified Green function G R (x o, x) on the ball B{x o, R ) rel- 
ative to the ball B(x o, 2 R) of double radius, G R ( xq, •) € B 1,2 nCQ (B(xo, 2 R)), 
satisfies 


Atp(x) G R (xo,x)dvol(x) = 


(dip(x) , dG R (x o, x))dvol(x ) 


B(x 0 ,2R ) B(x 0 ,2i?) 

= j- tp(x)dvol(x) 

B(x 0 ,R) 

for all (fi € if 1,2 with supp p B(x 0 ,2R). 
For purposes of normalization, it is convenient to consider 


,R/ \ |5(a;o,2i?)| R 


w R { x) := 


i? 2 


G H (x 0 ,x) 


(with \B\ \= voLB). 

We then have 

J {dip(x),dw R (x)) = -^2 J ‘fix) 

B(xo,2R ) B(xq,R ) 

for all <p € H 1 ’ 2 with supp <e B(xq,2R). 


We then have the estimates 


Corollary A. 2.1 

0 < w R < 7 i in B(x o, 2 R) 
w R > 72 > 0 in B(xq, R) 

for constants 71,72 that do not depend on R. 


The estimates of J. Schauder are also very important: 

Theorem A. 2. 3 Let L he as in (A. 2. 2), (A. 2. 3), and suppose that the coef- 
ficients a a h[x) are Holder continuous in (2, i.e. contained in C" T (1?) for some 
0 < a < 1 

(i) If u is a weak solution of 

Lu = k 

and if k is in L°°(f2), then u is in C 1 ' a (f2), and on every I2 0 <e 12, 
its C 1 ’ 7 -norm can he estimated in terms of its L 2 -norm and the L°°- 
norm of k, with a structural constant depending on f2, f2o,m.,a, A,/i 
and the C a -norm of the a a ^(x). 

(ii) If u is a weak solution of 
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Lu = k 

for k £ C u,a (ft), some v = 0, 1, 2, . . . , 0 < a < 1, and if the coeffi- 
cients a a P are also in C I '’ a (ft), then u is in C v+2,a (i ?), and a similar 
estimate as in (i) holds, this time involving the C v ' a -norm of k and 
the a a/3 . 

Finally, we quote the maximum principle. 

Theorem A. 2. 4 Let ft C R m (or, more generally, ft C M,M a Riemannian 
manifold) be open and bounded, f £ C 2 (ft) n C°(ft) with 

Lf > 0 in ft, 

L as in (A. 2. 2), (A. 2. 3). Then f assumes its maximum on the boundary dft. 

All the preceding results naturally apply to the Laplace-Beltrami operator 
on a ball B(xo,r) in a Riemannian manifold M, putting 

£ — 

( 7 a/ 3 )a,/ 3 =i,...,m the metric tensor of M in local coordinates, (7 a/3 ) = (7a/3) -1 > 
7 = det(7 a/3 ). 

References for the material in this appendix are: Gilbarg and Trudinger[89], 
Jost[140] (German version: [138]) and, with a more elementary presentation, 
Jost[139]. The results of Corollary A. 2.1 about Green functions are system- 
atically derived in [106], and in a more general context in [23]. Some further 
points about Sobolev spaces can be found in Ziemer[265]. 
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In this appendix, we briefly list some topological results. We assume that M 
is a connected manifold, although the results hold for more general spaces. 
A path or curve in M is a continuous map 

c : [0, a] — » M (a > 0). 

A loop is a path with c(0) = c(a), and that point then is called the base point 
of the loop. The inverse of a path c is 

c" 1 : [0, a] -> M 
c _1 (f) := c(a — t) . 

If Ci : [0,a 2 ] — > M are paths (* = 1,2) with C2(0) = ci(ai), we can define 
the product c± ■ C 2 as the path c : [0, a\ + 02 ] — > M 

, , _ f ci(t) for 0 < t < «i 

^ \ C 2 (t — a±) for ai < t < ai + a 2 ' 

Two paths Ci : [0,aj] with Ci(0) = C2(0) and ci(ai) = 02 ( 02 ) are called 
equivalent or homotopic if there exists a continuous function 

H : [0, 1] x [0, 1] — ► M 

with 

H(t, 0) = ci( — ), 

01 

H(t, 1) = C 2 (— ) for all t 

02 

H(0,s) = Cl (0) =c 2 (0) 

H( 1, s) = ci(oi) = 02 ( 02 ) for all s . 

In particular, c : [0, a] — > M is equivalent to c : [0,1 ] — > M with c(t) = o(^), 
and so we may assume that all paths are parametrized on the unit interval. 

We obtain an equivalence relation on the space of all paths. The equiva- 
lence class of c is denoted [c], and it is not hard to verify that [ 0102 ] and [c -1 ] 
are independent of the choice of representations. Thus, we may define 
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[Cl • c 2 ] =: [ci] • [c 2 ] 



In particular, the equivalence or homotopy classes of loops with fixed base 
point p £ Af form a group 7Ti(Af,p), the fundamental group of Af with base 
point p. 

If p and q are in Af and 7 : [0, 1] — > Af satisfies 7(0) = p , 7(1) = q, then 
for every loop c with base point q, 7 _1 C7 is a loop with base point p, and this 
induces an isomorphism between 717 ( Af , q) and 7 Ti ( Af , p) . We may thus speak 
of the fundamental group 717 (Af) of Af without reference to a base point. 
M is called simply connected if 717 (Af) = 0. A continuous map / : M — > N 
induces a map /# : 7 Ti(Af,p) — > 717 (TV, f(p)) of fundamental groups. 

A continuous map 

7r : X — > Af 

is called a covering map if each p £ Af has a neighborhood U with the prop- 
erty that each connected component of ir~ 1 (U) is mapped homeomorphically 
onto U. If p £ Af and H is a subgroup of 7Ti (Af, p) , there exists a covering 
7r : X — * Af with the property that for any x £ X with n(x) = p , we have 
7r*(7Ti(X, ar)) = if. 

If we choose H = {1}, we obtain a simply connected manifold Af and a 
covering 

7r : M — > Af. 

Af is called the universal covering of M. 

If 7T : X — > Af is a covering, c : [0, 1] — > Af a path, £0 € 7r _1 (a:(0)), then 
there exists a unique path 

c : [0, 1] -► X 

with c(0) = xq and c(t) = c is called the lift of c through Xq. 

More generally, if Af' is another manifold, / : M' — » Af is continuous, 
Po G Af, 2/0 G / _1 (Po)) ;z: o G 7r _1 (po)j there exists a continuous 

/ : M' X 

with /( 2 / 0 ) = a- 0 and / = tto/ if and only if / # (7n(Af', y 0 )) C 7r # (7ri(X, x 0 )). 
/ is unique if it exists. 

Let 7r : Af — > Af be the universal covering of Af. A decfc transformation is 
a homeomorphism ip : Af — > Af with 


7T = 7T O (p. 

Let 7r(a;o) = po- iri(M,po) then bijectively corresponds to 7r _1 (p 0 )- More 
precisely, x\ £ tt^ 1 ^) corresponds to the homotopy class of 77(7^), where 
j Xl : [0, 1] — > Af is any path with 7 Xl (0) = xq,^ Xi (1) = X\. The deck trans- 
formations form a group that acts simply transitively on 7r _1 (p 0 ), and associ- 
ating to a deck transformation <p(xo) £ tt^^po) then yields an isomorphism 
between the group of deck transformations and 7Ti(Af,po). 
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if 


If M and N are manifolds with universal coverings M and N, resp., and 

/ : M -*• N 


is a continuous map, we consider the induced homomorphism 


P ■= ft ■ TTl (M,p) -> 7Tl(IV, f(p)) 

of fundamental groups. If 7 r : M — > M is the universal covering, we can lift 
f o tt : M — > N to a map 

/ : M — > TV, 

because the above lifting condition is trivially satisfied as 7 Ti(M) = {1}. / 
is equivariant w.r.t. the above homomorphism p in the sense that for every 
A £ acting as a deck transformation on M, we have 

f(Xx)=p(X)f(x) for every x € M, ( B.l ) 

where p( A) acts as a deck transformation on N. We say that / is a p- 
equivariant map between the universal covers M and N. 

Conversely, given any homomorphism 

p : n 1 (M, p) -> 7Ti (N,q) 


and any p-equivariant map 

g : M -> N (with g(p) = q), 
not necessarily continuous, then g induces a map 

g' : M N 

whose lift to universal covers is g. g' is continuous if g is. 

Finally, if M is the universal cover of a compact Riemannian manifold 
M, a so-called fundamental domain F(M) for M in M can be constructed 
as follows: 

For simplicity of notation, we denote the group ’K\{M,Xq) operating by 
deck transformations on M by C, and its trivial element by e. 

Let d(.,.) be the Riemannian distance function on M. We select any 
zq £ M. We then put 

F(M) := {z £ M : d(z , zo) < d(jz, zq) for all 7 £ C, 7 ^ e}. 

F(M) is open. Since r operates by isometries, i.e. 

d(Xzi, XZ 2 ) = d(zi,z 2 ) for all X £ T, zi, z 2 £ M, 
we may also write 

F(M) = {z £ M : d(z, zq) < d(z, Azq) for all Ae f,A/ e}. 
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By its definition, F{M) cannot contain any two points that are equivalent 
under the operation of r. On the other hand, for any 2 G M, we may find 
some fi G T such that 

pz G F(M). 

Thus, the closure of F(M ) contains at least one point from every orbit of r 
in M. 

If / : M — > M is an integrable function, and if / : M — > R. is its lift to the 
universal cover of M, then 

J f (x)dvol(x) = j /(y)dvol(y). 

M F(M) 

Examples of fundamental groups: 

1) 7Ti(R") = {1} for all n. 

2) ir 1 (S 1 ) = Z. 

A generator is given by 

c: [0,1]^ 5 1 = {(*, 1 ,) GR 2 :* 2 + y 2 = l} 
c(t) = (cos 2tt t, sin 2nt) . 

The universal covering of S 1 is R 1 , and the covering map is likewise 
given by 

7r (t) = (cos 27rt, sin 27rt). 

3) 7Ti(S'”) = {1} for n > 2. 

4) 7Ti(SO(n)) = Z 2 for n > 3. 

The preceding results can be found in any reasonable textbook on Alge- 
braic Topology, for example in [92] or [231]. 
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